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Preface 


This book was written as a text, although many may consider it a mono- 
graph. 

As a text it has been used several times in both the one-year graduate 
quantum-mechanics course and (in its shortened version) in a senior quantum 
mechanics course that I taught at the University of Texas at Austin. It is 
self-contained and does not require any prior knowledge of quantum 
mechanics. It also introduces the mathematical language of quantum 
mechanics, starting with the definitions, and attempts to teach this language 
by using it. Therefore, it can, in principle, be read without prior knowledge 
of the theory of linear operators and linear spaces, though some familiarity 
with linear algebra would be helpful. Prerequisites are knowledge of calculus 
and of vector algebra and analysis. Also used in a few places are some 
elementary facts of Fourier analysis and differential equations. Most physical 
examples are taken from the fields of atomic and molecular physics, as it is 
these fields that are best known to students at the stage when they learn 
quantum mechanics. 

This book may be considered a monograph because the presentation here 
is different from the usual treatment in many standard textbooks on quantum 
mechanics. It is not that a ‘‘different kind” of quantum mechanics is pre- 
sented here; this is conventional quantum mechanics (“Copenhagen inter- 
pretation’’). However, in contrast to what one finds in the standard books, 
quantum mechanics is more than the overemphasized wave-particle dualism 
presented in the familiar mathematics of differential equations. “This latter 
dualism is only part of a more general pluralism” (Wigner) because, besides 
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momentum and position, there is a plurality of other observables not 
commuting with position and momentum. As there is no principle that 
brings into prominence the position and momentum operators, a general 
formalism of quantum mechanics, in which every observable receives the 
emphasis it deserves for the particular problem being considered, is not only 
preferable but often much more practical. The lesson that I believe can be 
learned from the situation in particle theory is that more is needed than just 
the solutions of differential equations and there exist algebraic relations 
other than just the canonical commutation relation. In atomic and molecular 
physics the use of these general algebraic methods of quantum mechanics 
may be merely of practical advantage but not necessary, but in particle 
physics they seem to be essential. It is this general form of quantum theory 
that is presented here. 

I have attempted to present the whole range from the fundamental assump- 
tions to the experimental numbers. To do this in the limited space available 
required compromises. My choice may, to a certain extent, reflect my 
personal taste. But it was mainly influenced by what I thought was needed for 
modern physics and by what I found, or did not find, in the standard text- 
books. Detailed discussions of the Schrédinger differential equation for the 
hydrogen atom and other potentials can be found in many good books.’ 
On the other hand, the description of the vibrational and rotational spectra 
of molecules are hardly treated in any textbooks of quantum mechanics, 
though they serve as simple examples for the important procedure of 
quantum-mechanical model building. Also, ‘elementary particles are much 
more similar to molecules than to atoms” (Heisenberg). So I have treated 
the former rather briefly and devoted considerable space to the latter. 

Groups have not been explicitly made use of in this book. However, the 
reader familiar with this subject will see that group theory is behind most of 
the statements that have been cast here in terms of algebras of observables. 

This is a physics book, and though mathematics has been used extensively, 
I have not endeavored to make the presentation mathematically rigorous. 
Most of the mathematics I believe to be rigorously justifiable within the 
framework of the rigged Hilbert space, which —in contrast to von Neumann’s 
formation but in accord with Dirac’s presentation? —is assumed to be the 
underlying mathematical structure. Except in the mathematical inserts, 
which are given in open brackets [M: _ |, the reader will not even be made 
aware of these mathematical details. 

The mathematical inserts are of two kinds. The first kind provides the 
mathematics needed, and the second kind indicates the underlying mathe- 
matical justification. The whole first chapter is a mathematical insert of the 
first kind. As presented here, the mathematics can only be appreciated in its 
applications. This suggests the pedagogical advice that the reader should not 
attempt to read the book in a linear fashion, one fully understood page after 
another, but that he should be content to obtain a superficial under- 


' It has, usually, also been adequately treated in an undergraduate course. 
* See, for instance, Dirac (1958), Jauch (1958), Ludwig (1954), and von Neumann (1932). 
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standing of a subject at first reading and then return to it later for a deeper 
understanding. 

Quantum mechanics starts with Chapter II, where the most essential basic 
assumptions (axioms) of quantum mechanics are made plausible from the 
example of the harmonic oscillator as realized by the diatomic molecule. 
Further basic assumptions are introduced in later chapters when the scope 
of the theory is extended. These basic assumptions are not to be understood 
as mathematical axioms from which everything can be derived without using 
further judgment and creativity. An axiomatic approach of this kind does 
not appear to be possible in physics. The basic assumptions are to be con- 
sidered as a concise way of formulating the quintessence of many experimental 
facts. 

The book consists of two clearly distinct parts, Chapters II-X] and Chap- 
ters XIV-XXI, with two intermediate chapters, Chapters XII and XIII. 
The first part is more elementary in presentation, though more fundamental 
in subject matter, and gives a more approximate description because it 
treats all systems as stationary. Chapter XII introduces the basic assumption 
of time development. Chapter XIII is just an application of previously 
developed concepts and attempts to illustrate the characteristic features in 
which quantum mechanics differs from the classical theories. The second 
part, which starts with Chapter XIV, treats scattering and decaying systems. 
The presentation there is much more advanced. 

Chapter XIV gives a derivation of the cross section under very general 
conditions, and Section XIV.5 may be the most difficult section of the book. 
Section XIV.5 and Chapter XV may be omitted in first reading if the reader 
can accept the results without derivation. Starting with Chapter XVI, 
applications are restricted to rationally symmetric systems without spin. 
Two different points of view—one in which the Hamiltonian time develop- 
ment is assumed to exist, and the other making use of the S-matrix—are 
treated in a parallel fashion. The required analyticity of the S-matrix is 
deduced from causality. One of the main features of the presentation is to 
treat discrete and continuous spectra from the same point of view. For this 
the rigged Hilbert space is needed, which provides not only a mathematical 
simplification but also a description which is closer to physics. In the last 
chapter, the rigged Hilbert space is used to describe a decaying system by 
eigenvectors of the energy operator with complex energy. This establishes 
the link between the S-matrix description of a resonance at a pole and the 
usual description of states by vectors in a linear space, and is another 
example of the advantages that the rigged Hilbert space provides. 

Written at the height of the atomistic point of view— this book tries to 
expose also the complementary way of understanding, the holistic view. 
(See also Heisenberg’s last lecture published in Physics Today 29, 32 (1976).} 
Though never mentioned explicitly except in the brief Epilogue, the duality 
between atomism and holism is the recurring theme throughout the book. 
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Mathematical Preliminaries 


The mathematical language of quantum mechanics is introduced in this 
chapter. It does not contain much more than the basic definitions. A proper 
understanding of this material will come through its application in the follow- 
ing chapters. 


I.1 The Mathematical Language of Quantum Mechanics 


To formulate Newtonian mechanics, the mathematical language of dif- 
ferential and integral calculus was developed. Though one can get some kind 
of understanding of velocity, acceleration, etc., without differential calculus 
(in particular for special cases), the real meanings of these physical notions 
in their full generality become clear only after one is familiar with the idea 
of the derivative. On the other hand, though, the abstract mathematical 
definitions of calculus become familiar to us only if we visualize them in 
terms of their physical realizations. Nowadays, no one would attempt to 
understand classical mechanics without knowing calculus. 

Quantum mechanics, too, has its mathematical language, whose develop- 
ment went parallel to the development of quantum mechanics and whose 
creation in its full generality was inspired by the needs of quantum physics. 
This is the language of linear spaces, linear operators, associative algebras, 
etc., which has meanwhile grown into one of the main branches of mathe- 
matics—linear algebra and functional analysis. Although one might obtain 
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some sort of understanding of quantum physics without knowing its mathe- 
matical language, the precise and deep meaning of the physical notions in 
their full generality will not reveal themselves to anyone who does not under- 
stand its mathematical language. 

Therefore I shall start the quantum-mechanics course with some of the 
vocabulary and grammar of this language. I shall not try to be mathemati- 
cally rigorous, since one can still communicate in a language that one does 
not speak completely correctly. I shall also not give all the mathematics 
that is needed at the beginning, and you need not be worried if you do not 
understand everything right away; one learns a language best by using it. I 
shall give at the beginning only as much mathematics as is necessary to make 
the initial statements about physics. We shall then have to learn new mathe- 
matical notions whenever they arise, while we proceed with the development 
of the physical ideas. 

Before we start to study the mathematical structures that are employed 
in quantum mechanics, we should make the following observation: A 
mathematical structure is not something real—it only exists in our mind 
and is created by our mind (though often inspired by outside influences). It 
is obtained by taking a set of objects and equipping this set with a structure, 
by defining relations between these objects. Modern mathematics dis- 
tinguishes three basic kinds of structures: algebraic, topological, and 
ordering. The mathematical structures we use are complicated combinations 
of these three. For example, the real numbers have an algebraic structure 
given by the usual laws of addition and multiplication, they have a topo- 
logical structure given by the meaning of the usual limiting process for an 
infinite series of numbers, and they have an ordering structure given by the 
relations expressed by <. 

We shall use predominantly algebraic structures, although in order to 
speak the mathematical language of quantum mechanics correctly, topo- 
logical structures are essential. We shall start with the definition of a linear 
space, then introduce linear operators and give the definition of an associative 
algebra. That will provide us with enough vocabulary and grammar to enable 
us to start communicating physics, in the process of which the meaning of 
these mathematical structures will be filled with substance. 


1.2 Linear Spaces, Scalar Product 


A set R of elements @, W, x,... 1s called a linear space if the sum ¢@ + w of 
any two elements @, eR and the product a@ of any element eR with 
any complex number a are defined and have the following properties: 


if ¢, WER, then d + WER. (2.1a) 
o+y=v+¢. (2.1b) 
(@+wW+x=O+ (4H. (2.1c) 
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There exists in Ra “zero” element Osuch that @ + 0 = dforallg@eR. (2.1d) 


if PER, then ade R. (2.1e) 
a(bd) = (ab)d. (2.1f) 
1-¢ = 9. (2.1g) 

0-¢ = 0 (the number zero appears on the left, the zero element on 
the right), (2.1h) 
a(d + W) = ad + ay, (2.1i) 
(a + b)d = ad + be. (2.1)) 


The element (— 1)¢@ is then usually denoted by — ¢; by properties 2.1g, k, h), 
d+ (—¢) = (1 + (—1))¢ = 06 = 0. 

The elements ¢, w, y of the space R are called vectors. A set M in the linear 

space R is called a subspace of R if M is a linear space under the same defini- 

tions of the operations of addition and multiplication by a number as given 

for R, i.e., if it follows from ¢, we M that abe M and @ + weM. 

An expression of the form a,@, + a,@, + --: + a,@, 1s called a linear 
combination of the vectors ¢,, 62,...,,; the vectors $,, $2,...,, are 
said to be linearly dependent if there exist numbers a,,a),...,a,, not 
all zero, for which a,¢, + a,¢, + --: + a,0, = 0. If the equation a,@, + 
a,o,+--:+a,¢, =9 holds only for ay = a, =--: =a, = 0, then the 
vectors ~;,@2,..., @, are called linearly independent. A space R is said 
to be finite-dimensional and, more precisely, n-dimensional if there are n 
and not more than n linearly independent vectors in R. If the number of 
linearly independent vectors in R is arbitrarily great, then R is said to be 
infinite-dimensional. Every system of n linearly independent vectors in an 
n-dimensional space R is called a basis for R. 

Every vector ¢ of an n-dimensional space R can be uniquely represented 


in the form @ = a,e, +-:- + a,e,, where e;, e2,...,, is a basis for R. 
The numbers a,,..., a, are called the coordinates of the vector ¢ relative to 
therbasiste,,...,e,. 


Evidently, when vectors are added, their corresponding coordinates 
relative to a fixed basis are added, and when a vector is multiplied by any 
number, all the coordinates are multiplied by that number. 

A linear space is called a scalar product space (or Euclidean space), if in it a 
function (¢, w) is defined, having the following properties: 


(¢,¢)>0 and (¢,¢)=0 iff d =0. (2.2a) 
(b, 6) = (9) (2.2b) 

(the bar denotes complex conjugate). 
(, a) = al, W) (2.2c) 


(a€ C, the set of complex numbers). 


(f, + 2, W) = (61, W) + (2, W). (2.2d) 
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This function is called the scalar product of the elements ¢ and w. Such a 
scalar product can be introduced in every finite-dimensional space; for 


example, if e,,...,e, is a basis for R and ¢ = aye, +--+ a,e,, W = 
b,e, + --- + b,e,, then, putting 
(?, W) = a,b, miretinc te G,Dy, (23) 


we get a function (@, w) satisfying the conditions (2.2). 
With the scalar product one defines the norm ||@|| of a vector ¢ € R by 


loll = J (4, 9). (2.4) 


A complex-valued function h(@, ) of two vector arguments is a Hermitian 
form if it satisfies 


h(¢, ) = hy, >), (2.5b) 
h(d, ap) = ah(?,p)  (aeC), (2.5c) 
W(p, + 2, Wh) = W(g,, W) + h(g2, W). (2.5d) 
If in addition h satisfies 
h(o, 6) = 0 (2.5a) 


for every vector ¢, then h is said to be a positive Hermitian form. A positive 
Hermitian form is called positive definite if 


from h(¢,¢)=0 follows @ = 0 for every vector @. (2.6) 


[Since the definition (2.5), (2.6) of a positive definite Hermitian form is 
the same as that of a scalar product, from now on we shall use only the term 
scalar product.] Positive Hermitian forms, which are not necessarily scalar 
products, satisfy the Cauchy—Schwarz-Bunyakovski inequality: 


[h(d, W)|? < h(d, Phy, W). (2.7) 
If h is positive definite, equality holds iff @ = aw for some ce€ C. 

Keeping w fixed, (¢, ¥) = w(@) is a function or functional of @ € R, and 
as this functional fulfills the conditions (2.2c), (2.2d), it is called an antilinear 
functional. In general every function F(@) that fulfills 

F(ad) = aF(¢), (2.8a) 
F($, + 2) = F(¢;) + F(¢2) ($1, $2 € R) (2.8b) 


is called an antilinear functional on the linear space R. 


An antilinear functional is called continuous iff from the convergence 
of the infinite sequence of elements ¢,¢R (n = 1,2,3,...) to the 
element ¢ € R, i.e., from 
?,7>@ asn—> oo, (2.9) 
it follows that 
F(¢,) > F(@) asn— oo. (2.10) 
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This definition of a continuous functional is of course in complete 
analogy to the definition of continuous functions of real and complex 
variables, except that here ¢,,@,,...,@ are not numbers but 
elements of a linear space R. If ¢,, 62, ..., @ are numbers, then the 
meaning of (2.9) is clear; it means |¢, — ¢| +0 as n— oo. For 
elements of a linear space, (2.9) has to be defined; i-e., one has to 
define what one means by the convergence of infinite sequences of 
elements of a linear space. A linear space in which the convergence 
of infinite sequences is defined is called a linear topological space. 
One can give various kinds of definition of convergence in a linear - 
space, i.e., introduce various topologies. For example, in a scalar- 
product space one can define (2.9) by 


od, —@|| ~0 asn- oo. @ hi} 


This definition of topology (which is, however, too narrow for the 
simplest mathematical formulation of quantum mechanics) leads to 
the well-known Hilbert space. 


In a finite-dimensional scalar-product space the scalar product defines an 
antilinear continuous functional and vice versa; for every antilinear con- 
tinuous functional F(@), there exists an element in R, which we shall also 
call F, such that F(¢) = (@, F), i.e., F(@) can be written as a scalar product. 
This is also true for the infinite-dimensional space in which (2.9) is defined by 
(2.11), ie., in the Hilbert space. In general, the set of antilinear continuous 
functionals F on R [1.e., functions F(#) that fulfill (2.8)] is larger than the 
linear topological space R. In analogy to the scalar product we shall denote 
these functions by 


F(¢)=<olF>. F(o) = <FI®>. (2.12) 
(When we are mathematically lax we shall often call them scalar products.) 


A vector ¢ is called normalized if \|@|| = 1. Two vectors ¢, are said to be 
orthogonal to each other if 


(p, p) = 0. 
A set of vectors e,€ R (i = 1, 2,...) is called an orthonormal system if 
(e;, ej) = 6j;- (2,13) 


Every orthonormal system is linearly independent. 
Let dE R; then 


a; = <e;|O> = (4, ) (2.14) 
is called the component or coordinate of ¢ relative to e;. 
The system e,, e,,-.. is called a complete orthonormal system or basis of 


R iff one of the following equivalent statements holds: 
From (e;,¢) = Ofor alli, it follows that = 0; (2.15a) 
d=) efe,,¢) forany PER (2.15b) 
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or, ina different notation, ¢ = ); |e,><e:|>; 
(d, W) = » (9, €;) (e;, W); (2.15c) 


loll? = L1G, eI. (2.15d) 


In quantum mechanics one often uses generalized basis systems. The 
elements e, (x in general any real number) of the generalized basis 
systems for a space R are not vectors of the space R, but are antilinear 
continuous functions on R. For the generalized basis e,, where 
x is an element of a continuous set of real numbers, the following 
statements equivalent to (2.15) hold: 


From <e,|¢> =Oforallx, itfollowsthat @=0; (2.16a) 


b= [dxecelo> = [dxixr<xld>; 166) 
(6.0) = [ dx dle d<enlW> = [dx blx><xIW>s Q.160) 


Io? = | 4x <dles><edl6> = [ ax <b1x><x1d>. (2.16d) 


1.3 Linear Operators, Algebras 


Let R bea linear space. A function A is called an operator in R if it maps each 
vector ¢ in a linear subspace of R into a vector ¥ = A(@) of the same space R. 
Often, when there can be no ambiguity, A@ is written instead of A(@). An 
operator A in R is called linear if 


A(d + W) = Ad + AY, A(ad) = aAd (3.1) 


for all ¢, ye R and all numbers a. Linear operators in a finite-dimensional 
space R can be represented by means of matrices. For this purpose we choose 
a fixed basis e,,...,e, for R. Every vector @€R (and, in particular, the 
vectors Ae,) may be expressed as linear combinations of the elements of the 
basis, according to (2.15): 


Ae, =a oy e(e;, Ae,) — y Cj a jx- (3.2) 
Jeu j=l 
The numbers 
Aj, = (e;, Ae.) = <e;|Ale.> (3.3) 
form a matrix of order n, which is called the matrix of the operator A relative 
to the basis e,,...,@,. For ¢,x€R one can form the scalar product of Ad 


and y:(y, Ad). This is called the matrix element of A between the vectors 
and x. 
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If R is infinite-dimensional, i.e.,n = 00, the matrix of the operator A is an 
infinite matrix. However, in the infinite-dimensional case it may happen that 
Ad for ¢ER does not have any meaning [because, e.g., ||A@||?, which 
according to (2.15) and (3.2) would have to be )°® , (A@, e,)(e;, Ab), may not 
exist, as the infinite sum may not converge]. We shall always assume that 
this never happens, i.e., in the cases we consider A is defined for every 6€ R 
and Ad is again an element of R. For quantum mechanics only operators with 
this property seem to be important (however, not if R is supposed to be the 
Hilbert space). 

It is possible to define the operations of addition of operators, multiplica- 
tion of an operator by a number, and multiplication of operators in R. In 
fact, by A + B, aB, AB we shall understand the operators defined by the 
formulae 


(A + B)p = Ab + Bd, (aA) = aA), (AB) = A(BO), (G4) 


for all ER. It is easily verified that A + B, aA, AB are linear operators if 
A, B are linear operators. In addition, it is easy to verify that the operations 
of addition, multiplication by a number, and multiplication together of 
operators correspond to the addition, multiplication by a number, and 
multiplication of their matrices relative to a fixed basis. 

Particular operators are the zero operator, which we again denote 0 and 
which maps every ¢ € R into the zero vector 0, 0¢ = 0; and the unit operator 
I, which is defined by Id = @ for every GER. 

For every linear operator A defined for all # € R, one can define an operator 
At by (Atd, W) = (¢, A) for every ¢,/ € R. The operator A’ is called the 
adjoint operator of A. An operator for which At = Ais called self-adjoint or 
Hermitian.’ 

If there exists a nonzero vector ¢ such that Ad = Ad (A€ C), then ¢ is called 
an eigenvector of A. The number / is called an eigenvalue of A. 

A set .o& is an (associative) algebra with unit element iff 


(a) -V 1s a linear space. 
(b) For every pair A, Be a, there is defined a product AB € .o such that 


(AB)C = A(BC), 
A(B + C) = AB + AC, 
(A + B)C = AC + BC, 
(aA)B = A(aB) = aAB. 
(c) There exists an element I € 7 such that 
LA Al 24 
for all AE A. 


A subset .o/, of an algebra is called a subalgebra of if .o, is an algebra 
with the same definitions of the operations of addition, multiplication by a 


1 We will usually use the term Hermitian if we do not want to distinguish between the 
mathematically precisely defined notions self-adjoint, essentially self-adjoint, and symmetric. 
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number, and multiplication as given for ~;i.e., if from A, Be W,, it follows 
that A + Be @w,,aAeav%,,andA- Bea. 

An algebra .& is called a *-algebra if we have on the algebra a *-operation 
(involution) A++ A? that has the following defining properties: 


(d) (aA + bB)t = aAt + bBt, 
(AB) = BtAt, 
(AT)t = A, 
r=], 


where A, Be & anda, be C. From the definitions of the sum and product of 
operators with a number—given in (3.4)—and from the definition of the 
formal adjoint operator, one can see that the set of linear operators fulfill all 
the axioms (a), (b), (c), and (d) of a *-algebra. Thus the set of linear operators 
in a linear space forms a *-algebra. A *-subalgebra of this algebra is called an 
operator *-algebra. It can be shown that in a certain sense every *-algebra 
can be realized as an operator *-algebra in a scalar-product space (generaliza- 
tion of the Gelfand—Naimark-—Segal reconstruction theorem). In quantum 
mechanics physical systems are assumed to be described by operator 
*-algebras. 

A set X,, X,,..., X, of elements of .o is called a set of generators, and J 
is said to be generated by the X; (i = 1, 2,..., n) iffeach element of can be 
written 


A=c+) 


=1 


cX, + >) clX,X;+---, (3.5) 
By 
where c, c', cl, ... EC. 
Defining algebraic relations are relations among the generators 
Pes) — 0; (3.6) 
where P(X) is a polynomial with complex coefficients of the n variables X;. 
An element Be «A/, 
B=b+) b'X, + ) bUX,X,+-°:, (3.7) 


where b, b',... € C, is equal to the element A iff (3.7) can be brought into the 
same form (3.5) with the same coefficients c, c', c'’", .. . by the use of the defining 
relations (3.6). 


CHAPIER 


Foundations of Quantum 
Mechanics— The Harmonic 
Oscillator 


This chapter, the longest in the book, introduces three of the basic 
assumptions of quantum mechanics and then illustrates them, using mainly 
the example of the harmonic oscillator. Though some historic remarks are 
included, neither the historic development nor any other heuristic way 
towards quantum mechanics is followed. The basic assumptions are 
formulated, explained, and applied. In Sections II.2, 11.4, 11.5, the basic 
assumption are introduced; in Sections IL.3, I1.8, I1.9, the harmonic oscillator 
is used to illustrate them. Section II.7 contains the derivations of some general 
consequences and might be omitted in first reading. The discussion for the 
continuous spectra, important for the description of the scattering and 
decay phenomena in the second part of the book, is given in Section II.10. 
Several remarks throughout this chapter emphasize the particular problems 
connected with generalized eigenvalues and eigenvectors and our unified 
treatment of continuous and discrete spectra. In Section II.11 we are ready 
to explain the physical meaning of the quantum-mechanical constant of 
nature, h. 


II.1 Introduction 


Physicists believe that there is something in nature, or in each restricted 
domain of it, that may be “understood”; that there is a structure in nature. 
To “understand” means to bring this structure into congruence with some 
structure in our mind, with a structure of thought objects, with a structure 
that has been created by our minds. For physics this structure of thought 

y) 


10 Foundations of Quantum Mechanics—The Harmonic Oscillator 


objects is a mathematical structure. So to understand part of physical nature 
means to map its structure on a mathematical structure. To obtain a physical 
theory, then, means to obtain a mathematical image of a physical system, by 
which we mean any suitably restricted domain of physical nature (e.g., just 
an atom, or even just a set of substates of it, or all atoms and molecules). 

For the domains of quantum physics the mathematical structures are 
algebras of linear operators in linear spaces. The discovery of this, the 
fundamental properties of the algebra, and the other basic assumptions of 
quantum mechanics was a very difficult process, the history of which we do 
not want to describe here. It is much easier to start with the basic assumptions 
and to show that conclusions derived from them really describe some parts of 
nature. This is the path we want to follow here. 


11.2 The First Basic Assumption of Quantum Mechanics 
The first basic assumption (“axiom”) of Quantum Mechanics is: 


I. A physical observable, defined by the prescription for its measurement. is 
represented by a linear operator’ in a linear space. The mathematical image 
of a physical system is an operator *-algebra in a linear scalar-product space 
H# . The algebra is generated by some basic physical quantities, and the multi- 
plication is defined by algebraic relations. 


To illustrate the axiom I, we use one of the simplest examples, the one- 
dimensional harmonic oscillator. The one-dimensional harmonic oscillator 
can be defined as the physical system whose mathematical image is the 
algebra of operators that is generated by the following Hermitian operators: 


H, representing the observable energy. 
P, representing the observable momentum. 
Q, representing the observable position. 


The defining algebraic relations that are fulfilled by these operators are 


[P,Q] = PO - OP = 2 I, (2.1a) 
P? pw? 
i ree (2.1b) 


where h, , and w are numbers. J is the unit operator, and i = ./—1. The 
physical meaning of the numbers yu and @ will be found by correspondence 
to the classical system to be the mass and frequency of the oscillator, which 
are system constants. The physical meaning of the universal constant? h 
will be discussed in Section II.11. 


' In Chapter XIX we shall also have to admit semilinear operators. 
? 2nfi = his called Planck’s constant. 
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Figure 2.1 Model of a one-dimensional oscillator. 


The justification for the above statements is that there exist in nature 
physical systems on which one can perform measurements that lead to 
numbers that can be calculated from the above assumptions. Therefore 
it is reasonable to say that these physical systems, which are as always: 
“idealized physical systems,” have as their mathematical image the above 
mathematical structure. The name “harmonic oscillator” for this physical 
system arises from the correspondence to a classical system that has the same 
name. 

The classical-mechanical system called a one-dimensional harmonic 
oscillator has the following structure (Figure 2.1). Two mass points, with 
masses m, and m,, are acted upon by a force F along the line joining the two 
points. The force F is proportional to the difference between the distance r 
and its equilibrium length ro: 


F = —k(r — 19) = —kx, (2.2) 
where x = r — ro. The potential energy for this system is then 
V = +$kx?, (2.3) 
and the kinetic energy is 
| 2 
== 2.4 
Exin Qu Pp, ( ) 
where we have introduced the reduced mass 
mM,My 
= —_—_ 2S 
- m, + Mm, =) 
and the momentum 
dx 
= L—. 2.6 
Cai, (2.6) 


We can therefore define a classical harmonic oscillator as a system whose 
potential energy is proportional to the square of the distance from its 
equilibrium position, or as a system whose total energy 1s 


hoes p? + 4kx?. (2.7) 
2u 


As is well known, such a physical system is an idealization of the mechanical 
system in nature that consists of two masses connected by a spring (Figure 
2.2). Such a mechanical system performs oscillations with a frequency 

k 


o= |-. (2.8) 
mM 
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Figure 2.2 Classical one-dimensional oscillator. 


If we write Equation (2.7) in terms of w by solving Equation (2.8) for k, we 
have : 
1 pa? 
E = —p? +——x?’. 2.9 
mn? + (2.9) 
This expression looks exactly like one of the relations defining the quantum- 
mechanical harmonic oscillator. The important difference is that in the 
classical case the energy E, the momentum p, and the position x are real 
numbers, whereas in the quantum-mechanical case these quantities are 
represented by the operators H, P, and Q, respectively. 
For real numbers a and b one always has the relation ab — ba = 0; thus 


px — xp = 0. (2.10) 


Generally, for operators A and B, AB — BA # 0, 1.e., operators do not, in 
general, commute. For the particular case of the momentum and position 
operators one has the relation (2.1a). 

Fundamental relations like (2.1a) and (2.1b) cannot be derived; they can 
only be conjectured. This is true not only for quantum mechanics but also 
for classical physics. Thus the relation (2.9) is essentially the result of con- 
jectures that go back to Newton. 

The process of conjecture in physical theory can be described simply as 
follows: First one considers the experimental data in a certain domain of 
physics, e.g., the atomig domain. Then one tries to construct a mathematical 
structure from which one can calculate numbers agreeing with the experi- 
mental data. The algebra of operators defined by the relations (2.1) is such a 
mathematical structure for the quantum-mechanical harmonic oscillator. 
The conjecture (2.1b) was simple for us, since it was obtained by correspon- 
dence with the classical case. The physical meaning of the system constants 
je and w follows from the classical analogue. The conjecture (2.1a), however, 
was not so simple. The replacement of the classical relation (2.10) by the 
quantum-mechanical relation (2.la), and thus the realization that the 
quantum-mechanical observables are not represented by real numbers but 
by operators, was one of the greatest achievements of science. The relation 
(2.1a) is called the Heisenberg (or canonical) commutation relation. The 
physical meaning of this relation will become clear when we study its 
consequences in the following sections. 


Since we do not want to develop quantum mechanics along its 
historical path we shall give only a very brief and partial sketch of 
the motivation for (2.1a). It was known from experiment that light 
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was emitted from atoms at discrete frequencies. At the time the 
developments leading to 2.1a took place, the atom was pictured 
as consisting of a positively charged nucleus with a very small radius 
and negatively charged electrons revolving around it at a distance 
of about 10°° cm. Describing this picture in terms of classical 
physics lead to a dilemma. The revolving and therefore accelerating 
electrons would emit radiation, leading to a continuous loss of its 
energy and a continuous change in the frequency of the emitted 
radiation until the electron would finally, and very rapidly, fall 
into the nucleus. This contradicted the experimental situation: the 
atoms are quite stable, and the frequency of the emitted radiation 
can only take certain definite values (spectral lines), which are 
characteristic of the atom. Hence classical theory could not decribe 
the observations in the atomic domain, if the above picture of the 
atom was kept. One did not wish to abandon the picture of a 
miniature planetary system for the atoms, since replacing the 
Hamiltonian by another classical Hamiltonian that would take only 
discrete values would be virtually impossible. To obtain only dis- 
crete values for the Hamiltonian, some extra conditions were im- 
posed on the classical theory that were not only supplementary to 
but also in contradiction with the classical theory. This was the 
so-called “old quantum theory” (Bohr). 

Quantum mechanics originated in the idea of Werner Heisenberg 
and Max Born (1925) and Erwin Schrédinger (1926) to interpret the 
Hamiltonian as a mathematical quantity that would take only 
discrete values. From an analysis of the harmonic oscillator, 
Heisenberg and Born and Jordan concluded that in order for the 
energy to be a system of discrete values, the momentum and position 
should be given by the matrices® 


Os (m2 Oye a (2.12) 


3 This particular representation of the P and Q as matrices of operators with respect to a 
certain basis of vectors in an inner-product space will be derived in the next section (Section II.3). 
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where 2zh = h is Planck’s constant. Schrédinger built on a partial 
theory due to de Broglie, according to which free particles should 
diffract like light and should therefore be described by a wave 
function that satisfies a wave equation. From this he concluded that 
the momentum should be the differential operator 


ae (2.13) 


applied to the wave function. Both the Heisenberg and the 
Schrédinger Ansatz lead to the relation (2.1a); and soon after the two 
formulations were made known, it was shown that they were two 
different realizations of the same mathematical entities, namely linear 
operators in a linear space. We shall give a brief description of 
subsequent historical developments in Section II.6.4 


The quantum-mechanical harmonic oscillator is an idealization of micro- 
physical systems in nature; e.g., diatomic molecules are, under certain 
conditions, quantum-mechanical harmonic oscillators. The simplest possible 
geometrical (classical) picture of the diatomic molecule is two atoms, of 
masses m, and m,, bound together by an elastic force. 

There is an essential difference between the descriptions, in everyday 
language, of the physical systems whose idealizations are the classical 
harmonic oscillator and the quantum-mechanical harmonic oscillator. 
Everyday language gives quite an adequate description of classical systems, 
e.g., two carts with masses connected by a spring. Everyday language is, 
however, insufficient to describe quantum-mechanical systems. The quantum- 
mechanical harmonic oscillator is best described by the mathematical 
structure given by (2.la, b). This is the justification for the above definition 
and the conjectures that led to it. 

The procedure of conjecturing the mathematical relations outlined above 
for the harmonic oscillator works for many microphysical systems, so we 
summarize it again: 


1. Obtain a classical picture of the microphysical system, and express 
the observables as functions of position and momentum coordinates. 

2. Replace the real numbers corresponding to momentum p and position 
x in the expressions for other classical observables by the Hermitian 
operators P and Q, respectively. The operators P and Q satisfy the 
Heisenberg commutation relation, and from them we obtain the 
algebra of observables for a quantum-mechanical system. 


Not all classical observables can be expressed in terms of momentum and 
position: the intrinsic angular momentum or spin of a top, for example, 
cannot be expressed as a function of the momentum and position of the top. 
For these observables the above procedure of correspondence between the 


“ A few historical remarks will appear throughout the book. For a description of the historical 
developments see, e.g., Jammer (1966). 
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classical and the quantum-mechanical observables has to be generalized. 
We can give a vague formulation of this conjecturing procedure: in the 
transition from the mathematical description of the classical system to the 
mathematical description of the corresponding quantum system, we replace 
the numbers representing classical observables by operators that represent 
the corresponding quantum observables. The mathematical property of 
these operators is specified by the mathematical (in particular algebraic) 
relations between them. These relations between the observables are obtained 
from the relations between the classical observables and some additional 
relations that specify the property of noncommutativity. These relations can 
often be obtained from analogy to known relations between similar 
quantities. 

There exist, (in elementary-particle physics for example) many quantum- 
mechanical observables without known classical counterparts. The defining 
relations between these observables can then be obtained only phenomeno- 
logically from a comparison between the consequences of these relations 
and the experimental data. 

We shall discuss many examples of this conjecturing procedure. Rarely 
is the choice of the relations unambiguous, and the ultimate justification of 
the defining relations, like that of any fundamental theoretical assumption, 
is their success in providing agreement between theoretical consequences 
and experimental data. 

The representation of physical observables by operators poses the problem 
of relating these mathematical quantities to experimental data, which are 
numbers. The solution of this problem is given by a number of further basic 
assumptions or axioms, which provide the actual interpretation of quantum 
mechanics. We shall introduce these axioms at suitable places when they 
are needed. 


11.3 Algebra of the Harmonic Oscillator 


We now give a mathematical description of the quantum-mechanical 
harmonic oscillator. To find the mathematical properties of the algebra of the 
harmonic oscillator means to find how the operators of this algebra act on 
the vectors of the space ”. In order to do this we introduce a new operator a 
and the operator a‘ (which is its adjoint): 


1 po i 
=-—~—/{ /—Q+-——P], 
: vl 2 2 / pooh 


1 Mo i 
at = Q =a ——— P), 
o2 (/ h ) poh 
Then we introduce the new operator 


N = a‘a, (3.2) 


(3.1) 
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which is Hermitian. Inserting ee into (3.2) we get 


N= = 2 + sh Pp? — + (PO — QP). (3.3) 
Using the relations ~ we obtain 
N= HH — 41. (3.4) 
Similarly we calculate 
aa’ £ = H +41. (3.5) 


Therefore as a consequence of the canonical commutation relation (2.1a) 
we obtain for a and at the commutation relation 


[a, at] = aat — ala =I. (3.6) 
Furthermore, we can express the energy operator H in terms of a and a! to 
obtain 


h 
H =hoata + a = ho(N + 41). (3.7) 


Now we want to find the eigenvalues and eigenvectors of H and N. As 
the eigenvectors of N are eigenvectors of H, it is sufficient to find all the 
eigenvectors of N. 

Let us assume that there is at least one eigenvector of N in #, and call its 
eigenvalue J. (This is not a trivial assumption, since there are many operators 
that do not have eigenvectors in #. In fact it is an additional assumption 
without which equation 2.la is ambiguous.) Let ~, be this eigenvector; thus 


NQ@, = 49;. (3.8) 
Then calculate Nag, using Equation (3.6), 
Nag, = (ata)ag, = (aat — Day = alata — Noe, 
= a(N — 1g, = aA — 1)9, = (A — 1)ag,. 
Thus 
N(ag,) = (A — 1)(aQ,), (3.9) 


so that either ay, = 0 or aq, is an eigenvector of N with eigenvalue 4 — 1. 
Suppose the latter is the case. Then we can define 


Pr-1 = Ay. (3.10) 
Repeating the above calculation on @,_,, we find that 
Q,-2 = aaQ, Cl) 


is either zero or an eigenvector with eigenvalue A — 2. We continue in this 
way and obtain a sequence of vectors 


Q,-m=a"p, (m=0,1,2,...), (3.12) 
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which are eigenvectors of N with eigenvalue (A — m), as long as ,_, # 0. 
In a similar way one calculates N(a‘¢,) to get 


N(ato,) = (A + I(atg,). (3.13) 
Hence ¢,,, = a'@, is either the zero vector or an eigenvector of N with 


eigenvalue A + 1. It is easily shown that always ~,, # 0. Suppose atv, = 0. 
That means 


lato, || = 0. (3.14) 
But 
lla‘, |? = (atg,, ate), (3.15) 
and by definition, 
(at, a'e,) = (@;, aa'g,). (3.16) 


By the commutation relation for a and a‘ we have 


(p,, aa’ p,) = (~;, atag,) + (4, 19) 
= |lag,||? + lg,ll? 4 0, 


since ~, # 0. Thus @,,, is always an eigenvector of N with eigenvalue 
A +1. Again by repeating the above process one obtains a sequence of 
vectors ~,,, (m = 0, 1, 2,...), which are eigenvectors of N with eigenvalue 
A +n. These are always nonzero, as follows from the above proof. 

We now determine when ¢,_,, can be zero. We calculate 


(3.17) 


(Gm, NQi-m) = (A — m)(@1-m> Pa—m) = (A — m)||Q,—mll? 
and 
(Pa—m> N@a-m) = (Pa—m> 2° ,—m) = Nl4Pa- ml”. 
Since the norm is always nonnegative, it follows that 
) ee Eee (3.18) 
Pr—mll 


Therefore the sequence of eigenvectors ”,_,, must terminate after a finite 
number of steps, and there must exist one vector Qo such that 


aPo = 0. 


The vector @, is an eigenvector of N with eigenvalue zero, since N@o = 
ata, = a‘0 = 0. Now we define the normalized vectors 


_ 
Po = Tool’ 
db, me C,a'do, (3.19) 


bn = C,(a')"bo 
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From the considerations above we know that the vectors @, are eigenvectors 
of N, with eigenvalue n, and the C, are chosen such that the @,, are normalized. 
Thus we have 


N¢,=1,, = Pall = 1. (3.20) 
The C,, are calculated as follows: From Equations (3.20) and (349); 
1 = Pall? = (abo, a") (Ci. 
but 


(abo, a‘ fo) = TomG (at De 1,4 ‘o,- 1)s 


SO 


IC.P? 


jy at La 
(Crea 


1C,_1/7 (,- 1, 4a Tone 1): 


Using Equation (3.6), 
IC, 


l= (Gu " Ge AGE aa 1)p,- 1) 
[C,|? 
= 16,2 Pn (N= 10,4) 
Cie 
= =n iP (d,- 1> Pn- 1) 
E. ie 
peel ; 
hence C,, must be chosen so that 
n(C,? =1C,-1]?. (3.21) 


Since ¢, is normalized, C, = 1, and one solution of Equation (3.21) is then 


ile 
Cae (3.22) 


There are other solutions of (3.21) which differ from (3.22) by a factor of 
modulus 1; we choose (3.22). 

We may summarize the procedure as follows: Start with a normalized 
vector dp € # that has the following property: 


agy = 0. (3.23) 


Then apply to ¢, the operator a‘ to obtain a system of eigenvectors of N, 


1 
Pn aa Jn! ao, (3.24) 
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with eigenvalues n = 0, 1, 2,.... These eigenvectors are normalized by the 
construction above, and they are orthogonal to each other since they are 
eigenvectors of a Hermitian operator with different eigenvalue (cf. Problem). 
Thus we have the orthonormality relations 


(dn; Pw’) a Oa: (3.25) 


and the ¢, form an orthonormal system in #. The operators a and a’ are 
defined on this orthonormal system by 


ag, = [noe 
a‘, Sy n+ 1Oy+1- 


Because of this property, a is called the annihilation operator and a’ creation 
operator. Both are often refered to as Ladder operators. All the elements of 
the algebra of the harmonic oscillator that are functions of a and at are 
defined on the elements of this orthonormal system. Consider the set of all 
vectors 


(3.26) 


vy — ee, (3.27) 


where the a, are complex numbers and the sum runs through an arbitrarily 
large but finite number n. This set of vectors forms a linear space called the 
space spanned by ¢,,. We will denote this space by # or ® when we ignore 
details. 


[More precisely, the space of all the Wy =). a,¢, for which 
Fo lanl? < 2 will be denoted by # (for Hilbert space), and the 
space of all w for which )°*. 9 |a,|7(n + 1)? < 00 for all p = 0, 1, 
2, 3,... will be denoted by ®. Obviously 


Oc #. (3.28) 


The advantage of ® over # is that all operators representing ob- 
servables can be defined on the whole space ®, but not on all of #. 
Though both spaces ® and # are mathematical generalizations, 
® is more convenient for physics. (® is called Schwartz space.| 


As all observables are functions of a and at, and a and a! are known by (3.26) 
and (3.27) for all y, all observables for the quantum-mechanical harmonic 
oscillator are in principle known on ®. So the task of determining the mathe- 
matical properties of the algebra of the harmonic oscillator is in principle 
completed. 

Let us calculate the diagonal matrix element of the energy operator H 
between the vector @, for a fixed value of n. Using Equations (3.7) and (3.20), 


we have 


Hd, = hoN + 3), = hoon + or (3.29) 
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and 


(Pn, Hb,) = holy, Noy) + hon, 2On) 
= hon + 3)/b,|? = hoon + 3). 


We call this number E,: 
E,, = han + 4). (3.30) 


II.4 The Relation between Experimental Data and Quantum- 
Mechanical Observables 


We now wish to formulate the basic axiom of quantum mechanics that gives 
the connection between the operator representing an observable and the 
experimentally measured values for this observable. Before doing this, we 
observe some fundamental differences in the experimental properties of the 
classical harmonic oscillator and he quantum-mechanical oscillator. 

Consider the classical picture in Fig. 2.1. Assume that the system has no 
energy—i.e., does not perform any vibrations, or vibrates with zero amplitude. 
The system can be excited into vibration if one shoots a structureless, massive 
projectile into one of the masses. The oscillator can perform vibrations of 
any amplitude, depending on the amount of energy transferred to it, which in 
turn depends on the momentum of the projectile. Thus for the classical 
oscillator one can prepare a projectile that excites the oscillator to vibrate 
with any desired energy E. The energy is given by 


= tka’, (4.1) 
where a is the amplitude of the vibrations 
x(t) = asin(wt + 6,). 


In the realization of the classical harmonic oscillator as two massive carts 
(see Figure 2.2), the system may be excited, for example, by hitting one of the 
carts with a third cart of suitable momentum. 

The situation is quite different for the quantum-mechanical harmonic 
oscillator. To see this we will consider an analogous experiment with 
microphysical systems. As is typical for experiments with microphysical 
systems, one does not have a single physical system but an ensemble of 
systems, i.e., a collection of systems that are in a certain sense identical. 
The ensemble of microphysical harmonic oscillators that is used in this 
experiment is a gas of CO molecules, and an electron beam serves as an 
ensemble of structureless particles. 


Remark: It should be remembered that the statement that a physical 
system is structureless has no universal meaning. Any system can be 
considered structureless under certain conditions but not under 
others. For instance, in the domain of physics for which the kinetic 
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Figure 4.1 Schematic diagram of an energy-loss experiment. 


theory of gases is valid, the molecules can be considered as structure- 
less systems, whereas when higher energies are considered, they,are 
seen to have structure. The electron is a system that can be ¢on- 
sidered to be without “energy structure” for all currently available 
energies. 


Experiments of this type are called energy-loss experiments, and are based on 
the original experiment performed by Franck and Hertz in 1914. 

Figure 4.1 shows a schematic diagram of such experiments. A beam of 
electrons leaves a monochromator” with a very narrow energy band of 
energy E,, and enters a collision chamber filled with an ensemble of CO 
molecules (or any other physical system whose structure is to be investigated.) 
After the electrons have passed through the CO gas, they go through an 
analyzer, which focuses those electrons of a similar narrow energy band E,. 
onto the detector. (The energy resolution is typically 0.005 to 0.05 eV; in 
particular experiment described in Figure 4.2a, it is 0.06 eV.) The energy E,. 
that is selected by the analyzer can be varied, so that one can measure the 
intensity I (the electron current at the detector) of the electrons as a function 
of the energy loss E = E, — E,. 

The result of an actual experiment, performed with the apparatus schema- 
tically depicted in Figure 4.2a on the diatomic molecule CO, is shown in 
Figure 4.2b.° What we see is that for an analyzer setting hig EL. — En=9, 
a major fraction of the electron current e does not lose any energy (elastically 
scattered electrons), as is indicated by the high value on the graph at v = 0. 
Then there is a relative maximum of intensity for the electrons that have lost 
the energy E, (~0.26 eV), which means that a large portion of the electrons 
of energy E, lose energy inelastically to the CO molecules. A second bump in 
electron intensity occurs at the energy loss E, = 2E,, and so on. Asa whole, 
Figure 4.2 shows us that the scattered electron current e’ is a mixture of 
electrons that have lost one of the eight discrete amounts of energy 
He. BaE,,.2., 6, at 0=90, o—1,...,0= 7, respectively, to the CO 
molecules. 

We conclude from this experimental situation: The vibrating molecule 
cannot be excited to any arbitrary energy value. Only a discrete number of 


5 A monochromator is an apparatus that prepares a beam with very well-defined momentum 
and, therefore, with very well-defined kinetic energy. An analyzer is the same kind of apparatus 
used for a different purpose. 

® From G. J. Schulz, Phys. Rev. 135, A988 (1964), with permission. 
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Figure 4.2 (a) Schematic diagram of double electrostatic analyzer. 
Electrons are emitted from the thoria-coated iridium filament. They pass 
between the cylindrical grids at an energy about 2.05 eV, and are 
accelerated into the collision chamber, where they are crossed with a 
molecular beam. Those electrons scattered into the acceptance angle of 
the second electrostatic analyzer pass between the cylindrical grids, again 
at an energy from 0 to ~2 eV. The electrons pass the exit slit into the 
second chamber and impinge onto an electron multiplier. (b) Energy 
spectrum of scattered electrons in CO at an incident electron energy of 
ZEN. 
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Figure 4.3 Energy-level diagram of the harmonic oscillator. 


energy values are possible; 1.e., the physical system has a discrete number of 
energy levels. This situation is represented by an energy-level diagram (Figure 
4.3). In the particular case of the vibrating molecule, the energy-level diagram 
consists of a series of equidistant levels, as can be seen from the experimental 
results in Figure 4.2b. 


From Figure 4.2b, one sees that this is only approximately true 
and that the distance between energy levels actually decreases with 
increasing energy. This discrepancy indicates the limitation of the 
harmonic oscillator as the model for the vibrating molecule. We will 
discuss the corrections to this model later. 


Comparing this experimental energy spectrum with the result in Equation 
(3.29), we see that they agree if we take AE = hw for the energy difference 
between two neighboring energy levels. Thus the diagonal matrix elements 
(¢,, H@,) give the possible energy values of the quantum-mechanical 
oscillator (e.g., vibrating molecule). We interpret the situation as follows: 
The projectile (electron) excites the harmonic oscillator (vibrating molecule) 
into any one of a discrete number of states described by @,,. If no excitation 
takes place, the harmonic oscillator is in the ground state, described by ¢o. 
If all the electrons that passed through the gas had lost the same amount of 
energy, i.e., if there were only one bump in the experimental curve at v = no, 
the ensemble of molecules in the collision chamber would be described by one 
of the ¢,,, say @,,- In this case the quantum-mechanical system would be said 
to be ina pure state ¢,,. This is often expressed by saying that “all molecules 
have the energy E,,,.” The results in Figure 4.2b show that the ensemble of 
molecules considered (i.e., those molecules that have collided with the 
electrons that are scattered into the acceptance angle of the analyzer, which 
in the experiment of Figure 4.2a is about 72° from the electron beam axis) 
is not in a pure state, since the electrons have not lost one particular value 
E,,» but rather a variety of energies Ey, E;, E2,..-. E,. Thus an ensemble of 
molecules is said to be in a mixture of states, or the state of the physical 
system is said to be a mixture. This mixture can be described by the set of 
vectors $9, 1, $25---,Pn»--- and a set of numbers (relative probabilities) 
Wo. W1>-++> Was++. Where w, is chosen to be proportional to the height h, 
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of the bump corresponding to the energy E,,, and they are normalized so that 
yw, = 1. (4.2) 


Thus one may form a mental picture of the CO gas as a collection of N 
molecules, where N, = w,N is the number of molecules with energy E,, 
in the ensemble. If it were possible to pick one molecule out of the ensemble, 
w, would be the probability that this molecule has energy E,,. However, the 
picking of a single specimen from.a microphysical system is not possible, and 
should not be done even in a Gedanken experiment. The mere physical 
meaning of w, is that it represents the relative intensity of the electron current 
that has lost energy E,,. 

Before we introduce a concise description of the state of a mixture, we 
shall give an alternate description of a pure state. To do this we shall need 
some more mathematics, which we now introduce. 


[An operator W is called positive definite if(¢, Wd) > Oforallde #. 
A Hermitian operator P is called a projection operator or projector iff 
ja 1th 

It is easily seen that the set Z = {fp € H#: Ph = } is a subspace 
AoW; forif¢,weZanda, Be C, then Piad + By) = «Pd + BPW 
= ad + By, ie., 4 is closed under multiplication by a number and 
under vector addition. Alternatively, 4 = {P¢:¢ € #}, and hence 
we write % = P#’. Asanexample, let W = R?, the three-dimensional 
Euclidean space, and let 4 be a plane passing through the origin. 
Then for any x € R? let Px be the ordinary projection of x onto the 
plane (see Figure 4.4). 

Two subspaces %,, 4, < # are said to be mutually orthogonal if 
for each @ € 4, and for each p € 4, we have (¢, ¥) = 0. The set 
A= {b+w: oe & and we &,} is called the direct sum of 4, and 4, 
and is denoted 4 = £, ® &3. Similarly, given a collection 4,, 4), 
#,,..., 0f mutually orthogonal subspaces, one can form their direct 
sum, %= ); © 4 = {)i Gi: G € A}. 

Given a projector P that projects onto the subspace 4, the operator 
I — P is also a projector. [Proof: Since P? = P, it follows that 


Figure 4.4 Projection operator onto a plane. 
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(— PP =1-IP-—PI+P?=I1-P-(P-—P)=!1-P] 
I — P thus determines another subspace, which we will denote by 4+. 
All vectors in 4* are orthogonal to those in 4; for if @ € Zand we 4!, 
then @ = P¢ and w = (J — P)w; using the definition (P* = P and 
P? = P) of a projector, we have (¢, W) = (Pd, (I — P)W) = (¢, 
PUI — P)w) = (¢, (P — P*)W) = (¢, OW) = 0. Since for any ye # 
we may write wy = Py + (I — P)y, any projector P gives a decom- 
position of # into orthogonal subspaces, #7 = 4 @ 4". 

For any ¢ € #, the set {ap: « € C} is a one-dimensional subspace 
of #, called the space spanned by @. 

Finally, given a basis {@,} of #, we define the trace of the operator 
A by 


Tr A = Y (¢,, Ad,). (4.3) 


It can be shown (Problem II.21) that Tr A is independent of the basis 
used for an arbitrary operator A. Tr A need not be finite. We will in 
general assume that this does not occur for those operators whose 
trace we take. The trace has the following properties: 


Tr(AB) = Tr(BA). (4.4a) 
Tmseneral, Tr(A,A,...A,) = Ur(A,A; ...A,A,)- 
Tr((A + B)C) = Tr(AC) + Tr(BC).] (4.4b) 


We now use these new mathematical concepts to describe the harmonic 
oscillator. Let 
4, be the space spanned by ¢o, 


4, be the space spanned by ¢,, 
4%, be the space spanned by ¢2, 


#,, be the space spanned by ¢,,. 


Then the space # is the orthogonal sum 


y O4,=%. (4.5) 


n=0 
Let A, (n = 0, 1, 2,...) denote the projection operators that project on /,. 
Then the operator form of (4.5) is written 


ee (46) 
n=0 


It says that every f € # can be written as the sum of its components A, fin 
the subspaces &,,: 


fo Af (4.7) 
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This is equivalent to writing 
f= ¥ bud) = ¥ bol (48) 


which is, according to (I.2.15b) the statement that @, is a basis. Thus A, 
may also be written A, = |¢,><,|, and (4.6) may be written 


1 = ¥ 1by><bnl- (49) 


We will often denote the projection operator A on the one-dimensional 
subspace spanned by a unit vector y by 


A= |W. (4.10) 


If we apply the operator H to the arbitrary vector f and use (4.9), we have, 
because of (3.29) and (3.30), 


als 2, Hla <bal f> =D Enlbn><bal f >. 
ue n=0 
As f was arbitrary, we may as well omit it and write 
H = ¥ E,lon><bal = ) EnAn- (4.11) 
n=0 n=0 


This is called the spectral representation of the operator H. With the notion 
of the trace we can express the energy values E, = (¢,, H@,) in the form 


Ee = THA.) (4.12) 
PROOF. Since 
= Oy, Wire =o 
No, = 
n® . Wile S22. 
one obtains 


Tr(HA,) =) (¢), HA, oy) 
u (4.13) 
= Se (¢,, HO, )0 yn aa EW 


Thus the state of the harmonic oscillator in which the energy is E, can be 
described by the projection operator A, in place of ¢,. Clearly A, and @, 
do not represent the same mathematical objects. A, represents the subspace 
h, =N\,H = {A, f| f € #}, which is the one-dimensional subspace spanned 
by ¢, (a one-dimensional subspace is often called a ray). The space 4, 
contains many normalized vectors, namely, all those that can be obtained 
from ¢, by multiplication with a phase factor e'*, where 0 < « < 27. (The 
set of all e'%@ with |||] = 1 is called a unit ray.) 

It is generally assumed that a pure state of a physical system is described 
by a projection operator on a one-dimensional subspace (or by a unit ray) 
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and not by a vector. However, it is very common to call not only A but also 
any vector pe A¥ a“state” or “state vector.” For the restricted case of pure 
states, we formulate the generalization of our above conclusions as the 
restricted version of the second basic axiom of quantum mechanics. 


Il’. A pure state of a physical, quantum-mechanical system is characterized 
by a projection operator A on a one-dimensional subspace (or by the one- 
dimensional subspace A# itself). The possible values of an observable A 
that are measured by an experiment performed on the physical system in the 
state A are given by 


<A> = Tr(AA). (4.14) 


<A> is called the expectation value of the observable A in the pure state A. 


The statement contained in II’ is incomplete as long as we do not define 
precisely what “expectation value” means, i.e., until we say exactly to which 
experimentally measured quantity <A> should correspond. Tr(AA) is 
something calculated from the mathematical image of the physical system; 
thus, given the mathematical image (i.e., the algebra), the observable A, and 
the state A, Tr (AA) is a precisely defined mathematical quantity. Equation 
(4.14) gives the relation between the mathematically obtained quantity 
Tr(AA) and the quantity <A>, which is measured in the experimental setup 
that defines the observable represented by the operator A. <A> is obtained 
by the following prescription: One performs a series of N measurements of 
the observable A; i.e., one either has N identical systems on which one 
performs the measurement, or one performs the same measurement N times 
ona physical system, making sure that the system is in the same state for each 
measurement. Let the numbers measured for the observable A in this experi- 
ment be denoted ap, a;,...,a,,---, and let N, be the number of measure- 
ments that gave the result a,. Then <A) is the average of these measurements, 


Ans 


Nn 
<A> = 2 a 
for sufficiently large N. The ratio w, = N,/N is the probability of obtaining 
the value a, in one measurement. 

If the system is in a pure state, say A, (the projection operator on the sub- 
space spanned by ¢,) then the intensity will have one bump at the energy 
E, — E, = E,.\f N measurements are performed on this system, the result 1s 
allways E,. Consequently 


N 


N 
=)\S—E =—E,. 15 
<i> day Ee wh Et (4.15) 
Hence, in the pure state A,, the probability w, for the value E, is w,; = | and 


all other w; = 0. 
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A pure state A is called an eigenstate of the observable A if A# is the one- 
dimensional space of all eigenvectors of A with the same eigenvalue.’ Thus 
the system in the above described experiment is in an eigenstate of the observ- 
able H if one bump appears at energy E,,. In general, as we shall see later, a 
physical system can be in a pure state described by a one-dimensional space, 
and the measurement of some observable A, of which this state is not an 
eigenstate, will still yield a set of different numbers a,, a2,...,4,,-..- This 
is the point at which quantum mechanics deviates drastically from classical 
theories. We shall discuss this matter in detail later. 

We now discuss the description of a mixture rather than a pure state. 
consider the gas of CO molecules in the collision chamber. If we measure 
the energy of the molecules N times, we expect 


No = WON measurements will give the value Ep, 
N, = w,N measurements will give the value Ej, 


N,, = w,N measurements will give the value E,,, 


because w; is the ratio of molecules in the ensemble, with energy E;. Thus the 
average value <H> of the energy operator H is 


CH) =) WaEn =D, Walbns Hoy). (4.16) 


which is to be equal to the expectation value of H in the mixed state. We 
want to write this expectation value as a trace of some operator; to do this 
we introduce the positive-definite Hermitian operator 


W = di waAns (4.17) 


where each A,, is the projector on the eigenspace 4%, of #% spanned by 4@,. 
Now we calculate 


Ti(HW) = (¢,, HW¢,) 


=) > w,(¢,, HA, ,). 
Since a 
Any = Ony Pn 
= LL walbs HOO 
we have 
Tr(HW) = Y Wil bns Hoy): (4.18) 


7 As will be discussed shortly, projection operators on multidimensional (including infinite- 
dimensional) subspaces may also represent states of a physical system. which are then, however, 
not called pure states. Also, in that case we shall call the state A an “eigenstate of the observable 
A” if A# is the space of all eigenvectors of A with the same eigenvalue. 
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Thus if the state of the mixture is described by the operator W defined in 
Equation (4.17), then the expectation value of the energy is given by 


<H> = Tr(HW). (4.19) 


It is important to note that in Equation (4.19), W is not a projection operator 
as it is for a pure state. 

The statement contained in Equation (4.19) is conjectured to be the general 
expression for quantum mechanics, and we formulate it as the second basic 
assumption. 


II. Thestate of a quantum-mechanical system is characterized by a positive- 
definite Hermitian operator W. The values measured in an experiment, i.e., 
the expectation value <A> of an observable A, are given by 


A> —TrAwW) i IrwW —1 
or by 


Tr(AW) 
Tr W 


<A> = 


The operator W is called the statistical operator, and its matrix is called 
the density matrix. The state of the system is called a pure state iff W is the 
projection operator on a one-dimensional subspace. 

It is customary to normalize the probabilities w,, so that 


io@) 
yw, = 1, - (4.20) 
n=0 
i.e., to normalize the statistical operator so that 


Tr W=1. (4.21) 


Note, however, that Equation (4.20) as well as Equation (4.21) is only a 
convention. Instead of w, we could have used the height of the bumps h,, to 
characterize the state. In that case we would get, in place of Equation (4.17), 


W=Th,A, (4.22) 
and 
TrW=>h, 
It is easy to see that 
W 
W = 
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W and W describe the same state of the harmonic oscillator. However, if one 
uses W to characterize the state, then the expectation value of the observable 
H is given by 

Tr(HW) Tr(AW) 


i = 4. 
TW or in general <A) Tr (4.23) 


(H> = Ww 


Thus, whenever possible, it is more convenient to use the normalized statisti- 
cal operator for the description of the state. 

The second basic assumption of quantum mechanics cannot be derived; it 
can only be conjectured. It has been conjectured, in the long process of 
discovering quantum mechanics, from a variety of experimental results. We 
have in our above considerations made this axiom plausible for the particular 
case of the energy measurement of the harmonic oscillator. 

Consider the state of the harmonic oscillator given by 


W = y w, An: 
The projection operator A, on the eigenspace of H with eigenvalue E,, is 


also an observable. Thus we can calculate the expectation value of A, in 
the state W: 


<A, = Tr(A,W) = ¥ THA, A,) Wy (4.24) 


From the properties of projection operators we know that 


A,A, =9 forp#n 


Ay Ay = Ans Coa 
consequently 
~ <Ay> = one ECA, )W, (4.26) 
= Tr(A,)wp, (4.27) 
SO 
<Ap> = Wp, (4.28) 


because Tr A, = 1, since A, is the projector on a one-dimensional subspace. 
Therefore A, is the observable whose expectation value is the probability 
of obtaining the value E, in a measurement of the energy, 1.e., the probability 
of finding the system in the state A,. In particular, for the pure state W = A, 
we have 


0 for p #n, 


1 forp=n. ae 


ca = Tr(A,A,) = 
Generally, projection operators represent a particular kind of observable 
called a property or proposition, whose expectation value tells us something 
about the state. 
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The basic assumption II states how the expectation value is calculated if 
the statistical operator W is known. We now discuss how the statistical 
operator has to be chosen in any particular case. To do this we use our belief, 
which is supported by all experimental evidence, that a repetition of a 
measurement always leads to the same result, or, in other words, that all 
experiments performed under the same conditions give the same experimental 
numbers. It is assumed that generally a certain physical system may be in a 


number of states 1, 2,..., which in quantum mechanics are described by 
Statistical operators W,, W,,..., and that these states can be determined 
by the measurements of observables A, B,.... Thus every state of the 


physical system is the result of a preparation of the system. A preparation is 
a series of manipulations with physical equipment that lead to the measure- 
ment of an observable (or several observables). These measurements may 
or may not affect the state of the system. In classical physics the effect of a 
measurement upon the state of the physical system is always neglected; the 
essential difference in quantum physics is that here this effect is taken into 
account. Thus in principle, and also in practice, not all measurements can 
be performed simultaneously. Therefore one must always be able to re- 
produce a certain physical state, so that it can be determined by a series of 
successive measurements. After the measurement of each observable, the 
system will be in a state that has been prepared by this measurement. This 
state must be described by a statistical operator for which the calculated 
expectation value of the observable agree with the values just measured. 
That is, the W must be chosen so that the calculated values agree with the 
measured values that one will obtain in an “immediate” repetition of the 
measurement, i.e., in a measurement that is performed before the state has a 
chance to change (develop in time). In fact, this principle was used when we 
wrote down the state of the CO molecules. In the example of the ensemble of 
CO molecules, the preparation consists of setting up the apparatus and 
bombarding the gas with monochromatic electrons. This prepares the CO 
gas in a state that is a mixture of energy eigenstates A,,, with the mixing ratios 
w,. The statistical operator W has to express what has been measured on the 
physical system, or what preparations have been performed. Therefore we 
choose the statistical operator to be 


W = Yw,A,- (4.30) 


With this W, one calculates for the outcome of the measurement of the energy 
H the numbers 
[Bao ope DO ee 


Thus the original values obtained in the preparation of the state are predicted 
for an immediate repetition of the measurement (i.e., a measurement before 
the state W has changed), which in the present case may be any finite amount 
of time later, since the situation in this experiment is stationary, but before 
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another measurement is made or before the setup is altered (e.g., by switching 
off the electron beam). 

For the general case we formulate the above described procedure for the 
determination of the statistical operator as a basic assumption: 


Illa. Given that an observable A has been measured with the following 
results 


1,42, 43,..-, 
Way Waz> Wa32°+*s 


where w, gives the relative frequency with which the value a; was measured 
(i.e., after N measurements the value a; occurred w,, N times), then the state 
of the system immediately following the measurement is represented by the 
statistical operator W given by 


1 


WwW = d Wa, Na, dim(A,, #)’ (4.31) 


where A,, are the projection operators on the eigenspaces of A with eigen- 
values a;, and where dim(A, #) = Tr A,, denotes the dimension of the 
space A,. 


Before we discuss the general case, let us consider the particular situation 
that a, has been measured with the probability w, = 1 and all other w,{i # n) 
= 0. Then the statistical operator is chosen as 

W=A : (4.32) 
7 *dim(Ay ; 
[Note. The A, are not necessarily projection operators on one-dimensional 
spaces. If A, is the prejector on a one-dimensional space, the state (4.32) 
is said to be a pure state. ] As we shall prove below, one obtains for the expecta- 
tion value 


<A> = Tr(AW) = Tr(AA,,) = a,, (4.33) 


i 
dim(A,, #) 
and for the probability 


Ww. = <A, = ina Tr A, sade (4.34) 


I 
dim(A,, #) 
In other words, this statistical operator (4.32) predicts, for an immediate 
repetition of the measurement of A, the value a, with certainty. Before we 
prove (4.33) and (4.34), it is convenient to present some more mathematics. 


[If the space A,.# of eigenvectors of A with eigenvalue a has the 
dimension dim(A,#), then there exist dim(A,#) linearly in- 
dependent eigenvectors. One may then choose an arbitrary basis of 


Experimental Data and Quantum-Mechanical Observables 33 


dim(A,#) mutually orthonormal eigenvectors of A with eigen- 
value a, i.e., a set of vectors 


ieee. Aen) 
that fulfill 
mae (4.35) 
and 
(ha, ba) = Oxx- (4.36) 


The basis for the whole space is given by 


1 2 dim(A(a1) 
Gea. s- ° ae ee? 


ena soa Ong ae (4.37) 


In general, dim(A,,,#) # dim A,,,#. This statement (spectral 


theorem), given here without proof, is only true for operators with a 
“discrete spectrum,” to which we want to restrict ourselves for the 
moment. Any of the dim(A,,#) may or may not be infinite. An 
arbitrary vector f € 4 is then written, according to (1.2.15b), 


f= 1O><G5,1f>- (4.38) 


K, aj 


If we apply A to the arbitrary vector f written in this form we obtain 


Af = > ae: < os lf: (4.39) 


or, if we omit the arbitrary f, 


A= Pi ail $5.>< ba) = La: 160 <a. (4.40) 


Kaj a 
The operator 


15> <Pa,1 = Aa, (4.41) 


is the projection operator on the subspace A,,# of eigenvectors of 
A with eigenvalue a;, since 


Aa Ma, = D105 <5 1G5> O51 = L195. <b5)15 a; = No, Fa, 


A(Ag, f) = 2 ail ba,><Pa,1f> = ai(Aa, S). 
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Therewith (4.40) can be written 
A aoe a A (4.42) 
This is called the spectral representation of the operator A (with 
discrete spectrum). The set if A,, is called the spectral family. 
If one omits the arbitrary f in (4.38), one obtains the resolution of 
the identity operator J with respect to the spectral family of the 
operator A: 


T= Y1b5><ba1 =D Aa: 


We now prove (4.33). We use the basis (4.37) and calculate 
Tr(AA,,) = Ds ($5,, AAa,05,) = >, (5,2 AG5,) 


= Yi a,($%,» 0%,) = ay dim(A,, 3). (4.43) 


Thus 


1 


Tr(A W) => Tr(AA,,) dim(A, #) 


——aiee 
Equation (4.34) is established in the same way. 

We now proceed with the introduction of some further mathe- 
matical notions: two Hermitian operators A, B are called commuting 
operators iff 


- AB=BA_ or[A, B] = 0. (4.44) 
Let A,, be the spectral family of A, and P, the spectral family of B. 
Then the statement that A and B commute is equivalent® to 

Na, Py, = pe gee tor very G;,D,. (4.45) 
Thus, using (4.42), an operator A commutes with B if 


AP,, = P,,A_ for every b;. ] (4.46) 


A pure state (e.g., the energy eigenstate of the one-dimensional harmonic 
oscillator discussed above) is a special case of (4.32). 

We turn now to the general expression (4.31), and calculate the expectation 
value 


Tr(AW) = Tr(AA,,). (4.47) 


de Gim(A, #) athe H) 


® In fact (4.45) is the precise statement of commutativity; in order to make (4.44) precise one 
has to add the requirement that 4* + B? be essentially self-adjoint (its Hilbert space closure is 
selfadjoint). 
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Inserting (4.43) into this, we obtain 


<A> = Tr(AW) = ¥ w,,4;, (4.48) 


as it ought to be for the expectation value, i.e., the average value in a series 
of measurements of the observable A in the state of the system described by 
W. 

The projection operator A, on the eigenspace of A with eigenvalue a 
represents the observable “probability of a,” as it should. To see this we 
calculate 


1 
dim(A,, #) 


1 
dim(A,,#) 


Tr(A, W) = ¥ wy, Tr(AgA,,) 


a 


= » Wa; One Tray 


ai 
Here we have used 


A,Aqg, =9 for a; # a, 
VA a ea a 


Thus the statistical operator W of (4.31) leads to the prediction we wanted, 
namely that an immediate repetition of the measurement of an observable 
gives the same value as the original measurement that constituted the pre- 
paration of the state. 

If no measurement has been performed that constitutes the preparation of 
a state, and no information concerning the state has been obtained, then all 
values a; will be equally probable in the outcome of an experiment: w,, = 
const. The statistical operator for such a state must, therefore, be taken to be 
proportional to the unit operator 


W=I. (4.49) 
If the system is known to be in thermodynamic equilibrium, then the 
statistical operator W is chosen to be the Gibbs distribution 


W =e Blk (4.50) 


where H is the energy operator, and k and T are the Boltzmann constant 
and absolute temperature respectively. A justification of (4.50) can be found 
in books on statistical physics. 

One of the consequences of the basic assumption IIIa is that the possible 
values of an observable that can be measured in any experiment are the 
eigenvalues? of that observable’s operator. This means that in the process of 


’ This applies to direct measurements, i.c., to measurements in which a state has been prepared 
as described in Ila. From these directly measured values one can often calculate the matrix 
elements of other operators representing physical quantities. Such observables have then been 
indirectly measured, by an experiment that prepared a mixture of eigenstates of the directly 


measured observables. 
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conjecturing the theory, the operator that represents an observable must be 
chosen in such a way that all the values obtained in the experiment that measures 
the observable in question are eigenvalues of the operator. 


11.5 The Effect of a Measurement on the State of a Quantum- 
Mechanical System 


The measurement of an observable also constitutes a preparation of the 
system. Generally, this means that the measurement of an observable 
causes alterations in the state of the system. These statements are now formu- 
lated in the second part of the basic assumption III. 


IIIb. Let asystem be in the state W, and let an observable B be measured on 
the system. Let the eigenvalues of B be b,, bz, b3,..., and let A,,, A;,,... 
be the projectors on the corresponding eigenspaces. Then the state of the 
system after the measurement is given by 


w' = ¥.A,,WA,,. (5.1) 
bj 


If this measurement is used to select a certain subensemble with a definite 
value b of B, then the state is given by 
1 
Ww’ = ——_——__A,, WA,. : 
THA, WA) ~ oe 
There is a special casein which the measurement of an observable does not 
alter the state W; this is the case if and only if the operator B that represents 
the observable commutes with W. Because only then Eq. (5.1) becomes, 
according to (4.46), 


Ww’ — } Ay, WA,, = » Ay, Ag, WwW 
om >» NeW a: 


and, since 


we get 


W' =) A,,W = W. 


Two observables are called compatible if the measurement of one does not 
alter the state that has been prepared by the measurement of the other. The 
preceding consideration then shows that compatible observables are rep- 
resented by commuting operators. Observables represented by noncom- 
muting operators are incompatible. 
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We give a brief justification of the basic assumption IIIb. The probability 
of obtaining the value b, for an observable of the system in the state W is 


<Ay,> = Tr(A,, W). (5.3) 


We know from experience that the immediate repetition of an experiment 
leads to the same results. Thus W’ must be chosen in such a way that the 
eigenvalues of B and their probabilities are the same as in the state W. The 
eigenvalues of an operator are a property of that operator; thus in order to 
fulfill this requirement we just have to demand that the probability <A,,>’ 
of obtaining b; in the second measurement be the same as the probability 
<A,,> in the first measurement: 


<A,,>’ = <A, for all b;, (5.4) 
or 
Tr(A,, W’) = Tr(A,,W) for all 5;. (5.5) 
It is easily seen that W’ given by the postulate IIIb satisfies relation (5.5). 
Tr(Ay, W’) = ¥) Tr(Ay, Ap, WA,,) 
j 


= Tr(A,, WA,,) 
= Tr(Ag, W). 


Thus the basic assumption IIIb leads to the relation (5.4), i.e., the repetition 
of the measurement gives the same result. 

We now show the converse, namely, that (5.1) follows from the assumption 
that an immediate repetition of the measurement leads to the same result—as 
expressed by Equation (5.4) or (5.5)—and the additional assumption that 
after the measurement of B, W’ must be given according to IIIa by a linear 
combination of projection operators, 


1 


ee 5.6 
Pres nl Ag oP) ©.) 


Generally w,, = Tr(W’A,,) and by (5.5) w,, = Tr(WA,,). Consequently 
under the assumption that an “immediate” repetition of the measurement 
leads to the same result [i.e., (5.5)], W’ is given by 


W’ = > Tr(WA, (5.7) 
bj 


1 
Js Gim(A;, 0) 


Let us consider the Hermitian operator A,,; WX, this operator has the 
properties 


Ay, WA,,f =0 for every f 1 A,, # (5.8) 
Ay, WA,,g = Av, Wa € A,,# for every ge A,, #. (Be) 
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Consequently it must be proportional to the projection operator A,,: 
Ay, WA,, = vjAz,- (5.10) 
From the trace of this equation we obtain the value of v;: 
Tr(vjAy,) = v; dim(A,, #7) = Tr(A,, WA,,) = Tr(A,, W); 


1 
Teen): 
Voter Re) 


Thus equation (5.10) becomes 


Ab, WA,, = Tr(WA,, Oab 


A,.=—;.—: 
) Ay, dim(A,, #) 
Inserting this into Eq. (5.7), we get 

W' = oy Ny, WA, 
bj 


which is what we wanted to derive. 


II.6 The Basic Assumptions Applied to the Harmonic Oscillator, 
and Some Historical Remarks 


We now return to the discussion of the harmonic oscillator. Let us assume 
that its state has been found to be 
aa 


where A,, is the projection operator on the space spanned by 4¢,,. This is the 
case if a measurement of the energy always gives the result E,. We now 
calculate the value we can expect to obtain if we measure the momentum P. 
According to the basic assumption II, 


<P> = Tr(PW) = Tr(PA,) 
=) (Om> PAnPn) 


= (Pn; P,), (6.1) 


so 
<P> = <n| Pn). (6.2) 


We leave it as an exercise for the reader to show that as a consequence of 
(3.26) and (3.1), 


Ce) (6.3) 
Similarly the expectation value of the position Q in the pure state W = A, is 


<Q> = <n|Q|n> = 0. (6.4) 
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We now calculate the expectation value of P? (ie., the kinetic energy P?/2y) 
and the expectation value of Q? (or the potential energy w*Q?/2): 


<P?) = (bn; P’>,) 


Using the equations (3.1), we have 


So 


(pty = HO" (4, (at — a)? 


Inserting for a and a‘ from Equation (3.26), we get 
<P?) = poh(n + 3) (6.5) 
In a similar fashion we calculate 


<Q?) = (¢,, Q7,). 
Again from (3.1), 


h 
Q? = a +a)"; 


so 
(es 0b) = 5 (ns a! + 0°64) 


and finally 


h 
<Q*> =w a (n + 3). (6.6) 


Although the expectation values of the observables P, Q, P?, and Q? are 
uniquely determined, it does not follow that one should expect to obtain 
the same value <A) for any measurement of an observable A (although such 
is the case in the measurement!° of the observable H in the pure state A,). 
What does follow—according to the definition of <A>—is that the average 
of the values obtained in a series of many repeated measurements of A on 
the state A,, is <A>, which of course is uniquely determined. In general it is 
possible that the values obtained in each measurement (a, for the first 
measurement, a, for the second, a; for the third, etc.) are entirely different 
from each other and widely dispersed. 


10 We remark that by measurement we always mean an idealized measurement in which 
the measuring apparatus does not lead to uncertainties: ie. the experimental error is zero. 
For the above measurement on CO this would mean that the bumps are considered to be 
“infinitesimally” narrow. 
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In order to describe the dispersion of the measured values quantitatively, 
let us define, for any observable A and state W, a new observable given by 


(A — <ADI)?. 
To see that this observable is suitable for the description of the dispersion 
of the measured values a,, a, a3,..., we consider the expectation value 
of (A — <A>)? in the state W, which is called the dispersion of A in the 
state W: 
dispw) A = (A — <A>w)>w 
= Tr(A2W) — 2 Tr(A< Ay W) + Tr(<A>w W) 
= Tr(4?W) — (Tr(AW))? = <A”) w — <ADw (6.7) 
(unless otherwise stated we always assume that Tr W = 1); thus 
dispyy)A = <A?>w — <ADwr- (6.8) 
For the special case of the observable H for the harmonic oscillator in the 
state A, we obtain 
disp, H = Tr(H7A,) — (Tr(HA,)) 
a (o,, H*,) aaa | ee (6.9) 
However, since @,, is the eigenvector of H with eigenvalue E,,, we have 
H’$, = HE, >, = E,H¢, 
= Enon 
thus 
disp,,,) H = E? — Ex = 0. (6.10) 


A system is said to be in an eigenstate of the observable A if its statistical 
operator is the projector on a space that consists of eigenvectors of A with 
the same eigenvalue. The above results show then that the dispersion of H 
in an eigenstate A, of H is zero. This is true in general—the dispersion of 
an observable A in an eigenstate of A is zero. Conversely, the dispersion 
of an observable 4 is zero only if the system is in an eigenstate of A. 

The energy eigenstate A, for the harmonic oscillator is not an eigenstate 
of either of the observables P and Q, so its dispersion is not zero. The disper- 
sion is easily calculated: 


dispia,,) P= <P?» i <P>?, 
From Equations (6.3) and (6.5), 
<P?) — <P>? = hon + 4). (6.11) 


Similarly 


. h 
disp,,,) 2 = <Q?> — <Q” =e (n + 3). (6.12) 
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The square root of the dispersion is called the uncertainty of A in the state 


W, and is denoted by AA: 
AA = ./disp A. (6.13) 


From Equations (6.11) and (6.12) it follows that 


AP AQ = h(n + 4). (6.14) 
Thus for any pure energy eigenstate of the harmonic oscillator, 
AP AQ > hi. (6.15) 


This relation is true in general, as we shall see later, and is a consequence of 
the Heisenberg communication relations [P, Q] = (h/i)I. It is called the 
Heisenberg uncertainty relation. 

We have seen above that in an eigenstate of an observable the dispersion 
of that observable is zero. An eigenstate is a state in which a measurement 
of the observable gives with certainty one value—the eigenvalue. In par- 
ticular, we saw that in the eigenstate A, of H the measurement of H gave the 
value E,, with certainty, 1.e., with w, = 1. 

Now consider the mixture W of the energy-loss experiment described by 
(4.17). By definition 


dispw) H = <H?) — <H)?. (6.16) 


Using (4.18) and the corresponding expression for <H >”, we have 


<H?> — <H>? = yy E;w, ars (r wE,) 


2 
=Y¥w, ¥ E2w, — (r »,E, > 0. (6.17) 
y y. a 


For the last inequality we have used the Cauchy inequality for sequences: 


90 


io (oo) 2 

en Dy; 2 (3 say with x, = ./WasYn = En/ Wy, (6-18) 
n=0 n=0 n=0 

which is a consequence of (1.2.7). Thus dispyy) H in a mixture is always greater 

than or equal to zero. It is equal to zero if 


w,—0,, Le, W=A,, 


and it deviates from zero according to the number of and values of w, that 
differ from zero. Thus disp H, and therewith AH, is indeed a quantitative 
expression for the uncertainty in the measurement of the observable H. 

It is no surprise that disp H or AH was greater than zero for the mixed 
state W of CO molecules in the collision chamber, since that was a mixture 
of CO molecules of different energies. The reason that the outcome of the 
measurement of H in W is not fully determined is that not enough prepara- 
tions had been performed to separate different systems, or more precisely, 
to separate systems in different states, from each other. 


42 Foundations of Quantum Mechanics—The Harmonic Oscillator 


The harmonic oscillators in the collision chamber can be separated, at 
least in a Gedanken experiment, into eight sets, each one being a set of N,, 
molecules having the same energy E,, i.c., such that a measurement of H 
on the nth such set gives with certainty the value E,. The state of each set is 
then A,. It is impossible to further separate (even in a Gedanken experiment) 
any of these sets into two or more subsets of different systems or systems in 
different states. This is the reason for calling A, a pure state. If we now per- 
form the measurement?! of an observable on the system in a pure state, one 
would expect from classical considerations that this should lead with 
certainty to a well-defined number, as a set in a pure state consists of only 
one species. If we choose for this observable H or N, this is indeed the case. 
However, if we choose P or Q, we see from (6.11), (6.12) that it is not. Even 
in pure states the outcome of the measurement of an observable is in general 
not uniquely determined. Thus statistics cannot in principle be eliminated 
from quantum mechanics; systems in the “same pure state” do not give 
identical values in a measurement. All that can be said is with what proba- 
bility w, a certain value a, can be expected in the measurement of an ob- 
servable. 

Let us illustrate this with a Gedanken experiment. We consider the measure- 
ment of the observable Q in the state A,, and ask for the probability with 
which we can expect the value x in a measurement of Q. By definition, the 
expectation value of an observable A is 


<A> = 2 Wii, (6.19) 


where a, are the eigenvalues of A, and w; are the probabilities for finding 
the values a, in a measurement of A. We calculate (Q> as follows: 


<Q> = Tr(QA,) =) <i@A, ID, (6.20) 


where w; = |i> is any complete basis system and we have used the following 
notation: 


(,~) =<bl¥> or i.) = ID, 
(¢, AW) = <A Al. 


For |i> we may take any complete basis system; let us therefore choose the 
system of eigenvectors of the operator Q, which we denote by |x>, where x 
is the eigenvalue of Q: 


(6.21) 


Q|x> = x|x>. (6.22) 
Then 


Q= Ox |x 
= ecu A, |x) 


=) XCM? Clee. 


'l An idealized measurement; see footnote 10. 


The Basic Assumptions Applied to the Harmonic Oscillator 43 


The last step follows from writing A, = |n><n|. 
We introduce new notation, defining 


Cie) = <xin>. (6.23) 
Then 


<Q> = ¥ xo, (x)b,() 
= yore) 7: (6.24) 


If we now compare this with the general expression (6.19) for the expectation 
value of an operator, we conclude that 


[<x|n>|? = 1b,(x)|? = wy (6.25) 


is the probability for obtaining the value x in the measurement of the ob- 
servable Q for an oscillator in the state A,,. @,(x) is called the wave function 
of the state with energy E,. Thus we have found that the probability for 
finding the value x for the position operator Q is the absolute value squared 
of the wave function. It was in this form that the probability interpretation 
was originally introduced in quantum mechanics by M. Born [influenced by 
an earlier suggestion of Albert Einstein (1916)]. 

As we shall discuss later, the }’, is in fact an integral, and w, = |<x|n)> F 
is a quantity of dimension cm“ ', ie., a probability density. The probability 
is the integral of w, over a certain interval. 

We have derived the fact that |@(x)|* describes the probability of finding 
the physical system at x from the basic assumption II. Historically the events 
took place in reverse order, and II is a generalization of the discovery by 
Max Born in June 1926 that the wave function (x), which —as we shall see 
in Section II.8—is a solution of the Schrédinger equation, is connected 
with probability. 

Born was at that time investigating the collision process between free 
particles and an atom, using the formalism of Schrodinger’s wave mechanics, 
which for this kind of process he found superior to the matrix mechanics 
of Heisenberg, Born, and Jordan. However, he could not accept Schrodinger’s 
interpretation that @(x) represents matter waves. Recalling these events, 
Born said in his Nobel Prize Lecture in 1954: “On this point [Schrodinger’s 
matter-wave interpretation] I could not follow him. This was connected 
with the fact that my Institute and that of James Franck were housed in the 
same building in Géttingen University. Every experiment by Franck and 
his assistants on electron collision appeared to me as a new proof of the 
corpuscular nature of the electron.” He calculated the formula for the 
scattered wave in the Schrodinger formalism (Born approximation'’) and 
concluded that if this formula is to allow for a corpuscular interpretation, 
then the most natural interpretation of (x), is that |(x)|? describes the 


12 See Section XIV.5 below. 
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probability density for detecting the corpuscle. He summarized his con- 
clusion in 1926: “The motion of particles conforms to the law of probability 
but the probability itself is propagated in accordance with causality.” 

Born’s interpretation was soon generalized to include other observables 
in addition to the position. And the time was ripe for the general formulation 
of quantum mechanics. The starting point for this was the discovery that 
the canonical transformations can be carried over to matrix mechanics and 
wave mechanics (F. London, May 1926; P. Jordan, July 1926). In this way 
the canonical transformations were recognized as transformations of an 
abstract space. Transformations of objects with continuous indices were 
introduced by Dirac and Jordan in December 1926. The mathematical 
elaboration of the London—Jordan-Dirac transformation theory was under- 
taken in Gottingen by David Hilbert together with L. Nordheim and John 
von Neumann (1926-1927). Notions like state and observable evolved. 
Subsequently, von Neumann developed an axiomatic theory of Hilbert 
spaces and established the association between physical states and Hilbert- 
space vectors as well as the correspondence between observables and linear 
operators. He also introduced the statistical operator. 

Parallel to the mathematical formulation went the development of the 
physical interpretation, which was discussed principally in Copenhagen 
(1926-1927). Whereas Bohr wanted to make the wave-particle duality the 
starting point of the physical interpretation, Heisenberg relied upon the 
transformation theory. His basic approach was to regard something as 
defined if it is measureable. In his famous Gedanken experiment in which 
he discussed the position measurement of the electron with a microscope, 
he explained the uncertainty of position and momentum (6.15), and there- 
with the probabilistic nature of quantum-mechanical predictions, as the 
effect of the measurement apparatus. Object and observer have no in- 
dependent reality; the measurement affects the state of the object (basic 
assumption ITI). : 

While Heisenberg derived his uncertainty principle from the transforma- 
tion theory, Bohr developed his conception of complementarity from the 
wave-particle duality (see Section II.11): There exist complementary prop- 
erties—like position and momentum—and the exact measurement of one pre- 
cludes the possibility of obtaining information of the other. Properties are 
not actualities; they are only possibilities for the physical system. These 
developments formed the basis of the so-called Copenhagen interpretation 
of quantum mechanics.'* Its physical content is formulated in the basic 
assumptions of this chapter. 


‘3 The interpretation of quantum mechanics suggested by Born and established by Heisen- 
berg and the Copenhagen school represented such a drastic departure from the well-established 
ideas of their time that decades had to pass before it could gain general acceptance, and even 
today new attempts are being undertaken to return to a deterministic interpretation. Born’s 
statistical interpretation of quantum mechanics was one of the most far-reaching contributions 
of physics to contemporary thinking. 
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Von Neumann’s Hilbert-space formulation could not accommodate 
continuous eigenvalues. Since that time, stimulated by the Dirac formalism, 
a new branch of mathematics—the theory of distributions—was developed 
and for the first time systematically presented by L. Schwartz (1950-1951). 
On this basis I. M. Gelfand and collaborators introduced around 1960 the 


rigged Hilbert space. Its use for quantum mechanics was suggested around 
1965. 


II.7 Some General Consequences of the Basic Assumptions of 
Quantum Mechanics 


After this detailed discussion of the case of the harmonic oscillator, let us 
investigate some general consequences of the basic assumptions of quantum 
mechanics. Let us consider an arbitrary state W of an arbitrary micro- 
physical system and see what we can predict for the measurement of the 
values of some arbitrary Hermitian observables A, B,... of this system. 
These predictions are contained in the following two statements, which we 
will prove below. 


1. The dispersion of any observable A in an arbitrary state W fulfills 
disp) A= 0 (7.1) 
with 
dispiyw, A = 0 
if and only if W is an eigenstate of A. 


2. The uncertainties of two observables A, B in an arbitrary state W 
fulfill the uncertainty relation 


AA AB > 3|<[A, B}>wl. (72) 


A special case of this uncertainty relation is the Heisenberg uncertainty 
relation; from 


h h h 
one i and (hi) =*tewi =" 
i 


1 
it follows that 


AP AQ > = (7.3) 


for any state W. The equality sign in the Heisenberg uncertainty relation 
holds only if W = Ao, the ground state of the one-dimensional harmonic 
oscillator. 

The uncertainty relation is a very general property of two operators, and 
just depends upon the assumption that the observables are represented by 
linear noncommuting operators in a linear space. It says that unless A and 
B are compatible observables, there are always states for which at least one 
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of the observables cannot be measured with exact accuracy (since for every 
operator C = i[A, B] there is always a W such that 


KO>w| = |[Tr(WC)| # 0). 


In the case that C (or —C) is positive definite (as, e.g., for A = Pand B = Q), 
|Tr(WC)| > 0 for every state W, and therefore the two observables A and 
B can never be measured simultaneously with unrestricted accuracy. If the 
system has been prepared in an eigenstate of one of the observables A, 
AA = 0, (i.e., A has been measured to give accurately one value), then 
according to (7.2) the values for B are completely uncertain. This is a con- 
sequence of the previously discussed fact that in quantum physics a measure- 
ment changes the state, and the measurement of one observable interferes 
with the measurement of another observable unless these two observables 
are compatible, i.e., are represented by operators that commute with each 
other. Thus only compatible observables can be simultaneously measured, 
and only for compatible observables do there exist common eigenstates. 


PROOF OF STATEMENT |. For every linear operator B and every positive definite W 
(for which the following operations are defined), 


Tr(WB'B) = > (9,, WB'B@,) = > (Bto,, WB"g,) = 0, (7.4) 
T ? 


because ((B*@,), W(B'g,)) = 0 for every (B'¢,), as W is positive definite. For 


B=A-—al=B' witha=<(A>=TrWA G3) 
it follows from the definition (6.7) of disp A that 
dispw) A = Tr(W(A — al)’) = 0. (7.1') 
Assume that 
disp 4 = (BB) — 0) for B— 2 (7.6) 
then 
os (y,, WBBg;) = ee b*(y, Wo) = 0, (7.7) 


where the basis @f is chosen as in (4.35)-(4.37). As W is positive definite, all terms in 
the sum are > 0, and it follows that 


b?(ok, Wok) = 0 for every b, k. 
As W # 0, there must be at least one 3. for which 
(g5, Wo,) = 0. (7.8) 


For this pt.. b'? = 0, Le, b’ = 0. 
Let us assume that there is only one such b’. 
If we choose (7.5) for B, then y} is also an eigenvector of A and 


(A — al)of = bo; = (a — a)Qj. (7.9) 
The eigenvalue a’ that corresponds to the eigenvalue bh’ of (7.8) must therefore fulfill 


2 or ea (7.10) 
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From this it follows that 


1 


W = A,=——., 
ie fave 


the eigenstate of A with eigenvalue a’. Let us now assume that there is more than one 
value b’ that fulfills (7.8), say b, and b, # b,. Then there must be two corresponding 
value a # a’, that fulfill 


a, = Tr(WA) and a, = Tr(WA), Galt) 


which is impossible. 
Thus if (7.6) is fulfilled, then 


1 


Tae aie 
ele 


(7,12) 
where A,, is any of the eigenstates of 4. That disp A = 0 for W = A, an eigenstate of 
A follows immediately from (7.7). Therefore statement | is proven. Oo 
PROOF OF STATEMENT 2. For two operators A and B we define 

h(A, B) = Tr WA'B = Tr(WI1(A'B)) = hU, A'B) = (A'B). (7.13) 
One easily shows from (4.3) and (4.4) that 


(a) h(A, A) =0 (which is done in (7.4)), 
(b) ACA, B) = h(B, A), 

(c) h(aA, B) = ah(A, B), 

(d) h(A + C, B) = H(A, B) + H(C, B), 


i.e., h(A, B) defined by (7.13) fulfills all the conditions of (1.2.5) that define a positive 
Hermitian form. Since h(A, B) is a positive Hermitian form, the Schwarz inequality 
(1.2.7) holds: 


|h(A, B)|? < A(A, A)h(B, B). (7.14) 
From this, statement 2 is easily proven (we assume A, B to be Hermitian): 
|<[A, B]) |? = |hU, [A, BY)? = |ACA, B) — ACB, A)? < 4h(A, A)ACB, B) 
so if we replace 
A>A-—al, a = Tr(WA) 
B>B-—fI,  f =Tr(WB), 
we obtain 
h(A — al, A — a1h(B — BI, B — BI) > 4|hU, [A — al, B — BI))|’ 
of (for A and B Hermitian) 
disp A- disp B > 4|h(1, [A, B])|? = 41<CA, Be (7.15) 


The square root of this gives (7.2), and therefore statement 2 has been proven. ia 
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11.8 Eigenvectors of Position and Momentum Operators; the 
Wave Functions of the Harmonic Oscillator 


In this section we shall investigate the properties of the wave functions. 
We will begin by examining the properties of the operators P and Q; in 
particular, we wish to know what are the possible eigenvalues x of Q. That 
is, for which values x can 


Q|x) = x|x) (8.1) 


be fulfilled? Then we wish to find the transition coefficient (x|n) which 
transforms between a vector |n) = @, (an eigenstate of H and N) and the 
vectors |x) (eigenstates of Q): 


|n) =) |x)In). (8.2) 


x 


From (3.1) and (3.26) it follows that 


ie 
Q|n) = a + a’)|n) 


~ [Jain = 1) + Jn + Tin + D) (8.3) 


Taking the scalar product of this equation with |x), we obtain 


h 
(n|Q|x) = [7a5 mtn — 1|x) + ./n + 11 + 1|x)). (8.4) 
On the other hand, taking the scalar product of (8.1) with |n), we obtain 


(n|Q|x) = x(n|x). (8.5) 
Comparing (8.4) with @.5) yields 


x(n|x) = Jin — 1|x) + /n + 1(n + 1|x)), (8.6) 


or, denoting n + 1 = m, 


\/m(m|x) = “0? xm — 1|x) — ./m — 1(m — 2|x). (8.7) 


Since (8.3) is valid for n = 1,2,..., ($7) as; valid forces) 3, (2% For 
n = 0 (i.e., m = 1), we obtain instead of (8.3) 


ji h 
Q|0) = ae ee 10+ 1)= lag |Ds (8.3a) 


and instead of (8.7), 
2 
SIX) = |= xl) (8.7a) 
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Thus we see that (8.7) is a recurrence relation for (m|x); if (0|x) is known, 
we can determine (1|x) by (8.7a) and then determine (2|x) by (8.7). With 
(1|x) and (2|x) we can determine (3|x) by (8.7), and so on. 

To find out what the transition coefficients (m|x) are, we introduce 


per tt |) 
Ia) =A 2a lx)’ (8.9) 


which is defined for all x with (0|x) 4 0 (if (0|x) = 0, then by (8.7) and 
(8.7a) (n|x) = 0 for all n). Then from (8.7) it follows that 


m 2 m-—1 
[ Fr Sly) = [5-1 ¢q — yi m1) = [5"= Fm — ai Im 20), 


and 


(8.10) 
or 
Sal ¥) = 2yfm—1(y) — 2(m — 1) fin- 2). (8.11) 
From (8.7a) we have 
fi) = 2y fo), (8.12) 
and from (8.9) 
foly) = 1. (8.13) 


Equations (8.11), (8.12), and (8.13) are the recurrence relations for the 
Hermite functions and have solutions for any complex number y.'* Thus 
for any complex value x there is a solution (n|x) of the recurrence relation 
(8.7). For physical reasons we need to consider only real values of x, because 
x is the value that is obtained in the position measurement and must be real. 
[Mathematically, nonreal values of x can be excluded by the requirement 
that (n|x) be square-integrable.] For real values of y, the solutions f,,(y) of 
(8.11), (8.12), (8.13) are the Hermite polynomials: 
Ly fe 
fy”) = Hy) = (—1)"e” a (8.14) 


Thus from (8.9) we may obtain the transition coefficient (n| x) for every real 
value of x for which (0|x) is defined. We restrict ourselves to those solutions 
of (8.7) for which (0|x) is finite, because \(O|x)|?, the probability for ob- 


taining the value x in a measurement of Q in the ground state Ao, is assumed 
to be finite. 


14 NN. Lebedev, Special Functions and Their Applications, Prentice Hall (1965) Section 10.4. 
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Combining (8.8), (8.9), and (8.14), we have 


1 
(n|x) = aq Obom( [4 x) (8.15) 


for —0o0 <x < +00. 


[The set (|x) = ¥, (bn |x), considered for a fixed value x as a 
function of wy, is an antilinear functional over the space ® of the y 
in Equation (3.27) [i.e., it fulfills Equations (I.2.8)]. Thus |x) is a 
functional, and one can show that for real values of x, |x) is even a 
continuous functional [Equation (I.2.9)]. As was mentioned in 
Chapter I, the set of continuous linear functionals on a space , 
which we denote by ®%, is in general larger than the space ®, except 
for the Hilbert space #, where #* = # (Frechet-Riesz theorem.) 
Taking into account (3.28), we conclude ® < # c< ®%. This triplet 
of spaces is called the Gelfand triplet or rigged Hilbert space. The |x) 
that obey (8.1) are elements of ®* and are called generalized eigen- 
vectors of the operator Q; the set 


|x) —-oO<x< +0 


is called a generalized basis. The set of real eigenvalues of a Hermitian 
operator we call its spectrum.* The spectrum of the operator N, 
(2.31), or H, (2.37a), is called discrete, because it is a discrete set; 
the spectrum of the operator Q is called continuous, because it is a 
continuous set. As x can be any real value, the sum in (8.2) should 
actually by an integral. A famous theorem, the nuclear spectral 
theorem, states that every y € ® can be written as the continuous 
form of (8.2), i.e., 


o= fax Bone fax Ix>o(x). (8.16) 
Here we denote the generalized eigenvectors of Q, 
Q|x> = x|x), (8.17) 


by |x> instead of |x) to emphasize the difference between them and 
the eigenvectors with discrete spectrum, |n). If we take the “scalar 
product” of @ in (8.16) with |x’>, or more precisely the functional 
|x’> at the element @, we have 


Bi i Pee Ab eaten (8.18) 


<x’|x> is therefore that mathematical object which by integration 
(8.18) maps the “well-behaved” function g(x) = <x|@)> into its 
value at the point x’: g(x’) = <x’|@>. One can prove that such an 


* This is not always identical with the precise definition of the spectrum of a selfadjoint 
operator in the Hilbert space. 
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object cannot be a locally integrable function; it is called a gen- 
eralized functions or distribution. The particular distribution <x’ |x> 
is called the Dirac 6-function and is often denoted 


x i> = 0x = x), (8.19) 
so (8.18) is written 


o(x) = <x'|@) = | dx 8x! — x)<x|9) 


= fax O(x’ — x)ep(x). (8.20) 
In the transition from the symbolic (8.2) to (8.16), 


¥ |x)(x| was replaced by fax Ix> <x]. 


Consequently, the states |x) and the generalized states |x> must 
have different dimensions in order that 


lye and dx |x><x| 


‘may have the same dimension. 

The eigenvectors are always chosen so that they can be normalized 
to 1, i.e., they are chosen to be dimensionless. This is expressed by 
the normalization of a system of eigenvectors of an operator with a 
discrete spectrum: 

1 forn=rn', 


\ = - — Al 
(n|n') = Say, Where Onn ‘0 en (8.21) 


The generalized eigenvectors |x>, on the other hand, do not have 


dimension 1 but rather the inverse of ./dim dx. For example, if dx 
has the dimension cm, then <x’|x> has dimension cm~ 1 The 
generalized eigenvectors are normalized in the sense of (8.19) and 
the 6-function has the same dimension as x’. Instead of generalized 
vectors with the 6-function normalization (8.19), we could also 
choose generalized eigenvectors |x}, with a different normalization. 
Instead of (8.16) we could write 


9 = [du(x) Ix}y (x19) (8.22) 
with the “normalization” 
aX’ |x}, = g(x)d(x' — x), (8.23) 
where 
du(x) g(x) = dx (8.24) 


The most convenient choice of g depends upon the property of the 
operator. For the operator Q, g(x) = 1 is a suitable choice; we will 
therefore use the generalized eigenvectors |x >. 
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Equation (8.17) is the analogue, for the operator with continuous 
spectrum, of (3.20) for the operator N (or H) with discrete spectrum. 
As ¢ in (8.16) is an arbitrary vector, we can omit it and obtain the 
analogue of (4.9): 


[= fax Perea e (8.25) 


If we apply the operator Q to the arbitrary vector in (8.16) we 
obtain . 


oo [ax x1 <xlo0), 


or, if we omit the arbitrary @, we obtain the continuous analogue 
of (4.42): 


Q= fax oS OCA (8.26) 


This is called the spectral representation of the operator Q. It is the 
analogue of (4.42), which was the form for operators with discrete 
spectrum. The |x» <x| are the analogues of the projection operators; 
however, they do not project on any vector of the Hilbert space 7 
(or ®), because the |x) are not elements of these spaces. Projection 
operators are 12 dx |x><x| where the integration extends over a 
finite interval Ax. | 


We now return to the evaluation of the harmonic-oscillator function and 
calculate (0|x>. We write ¢, = |n) in the form (8.16), 


* In) = [ax txy<etn) (8.16) 


and take the scalar product of this with @,,, 


bmn = (Pm> Pn) = (min) = fax (m|x><x|n). (8.27) 


We shall make use of 


<x|n) = (n|x). (8.28) 


[If|n> = ¢, and |x> = x both were vectors in the Hilbert space then 
(8.28) would be the condition (1.2.2b) of the definition of the scalar 
product. However (n|x> is the functional |x> € D* at the element 
|n)€@® and <x|n) is a functional |n)e®*~ = ® at the element 
|x> € ®*, so (8.28) is not obvious. But it can be shown that (8.28) is 
generally true for transition coefficients between a discrete basis and 
a continuous basis. Hereafter we shall drop the typographical 
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distinction | ) as opposed to | >, and shall use generally >, unless we 
we want to stress the distinction between generalized and ordinary 
vectors. | 


We insert (8.15) into (8.27) and obtain 


Jarno J 8m? 


If we compare this with the orthogonality relations for the Hermite poly- 
nomials, ‘> 


2 


Ai(y)H(y) = Om. —«(8.29) 


[ay eM Ha()#A0) = ° mabaiha 
n!2"\/n form =n, 


h ms HO _ ya 
(ol fay) - [ee : (8.31) 


Thus, up to an arbitrary phase factor (which we choose to be real), 


(8.30) 


we find 


1/4 
Gi = (= eo (ueo/2R)x? (8.32) 


With this and (8.15) we obtain the transition coefficients <n|x> between 
the x- and n-bases, i.e., the harmonic-oscillator wave functions @,(x): 


| po eee 
P(X) = <n|x> = (‘<" tal xe ONE Ue 3) 


A few of the wave functions are plotted in Figure 8.1. 
We now repeat the procedure for the operator P that we have already 
gone through for the operator Q. The eigenvectors of P will be denoted 


by |p>: 


P\|p> = pip? (8.34) 
The action of P on |n> (calculated in the problems for Chapter IV) is 


y Pee (a — at)\n> 


_ ree (Jaln = 1) — Jn Un + 19) (8.35) 


P|n> 


15 |_ebedev, ibid., Section 4.13. 
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Pulx) 


pat, 


we {H nan on 


Figure 8.1 Wave functions of the harmonic oscillator [from Ludwig 
(1954), with permission]. 


If we take the “scalar product” with |p> and use (8.34), we obtain 


p<p|n> = —i Pee (Jaceln —1)-—Vn+ l<pin+1)), (8.36) 


or 


p<n|p> = ye (/nkn — 1]p> — /n + In + Ip). (8.37) 
If we introduce the new quantities (n|p) defined by 
<n|p> = i"(n|p), (8.38) 
then 
i<n — 1|p> = ii” *(n — 1 |p), 
—i<n + 1|p> = —ii***(n + 1p), 
so (8.37) may be written 


h 
p(n|p) = (/n(n — 1|p) + /n+1(n+1|p)). (8.40) 


We see that this is exactly the same recurrence relation as in (8.6), with 


X,/ p/h replaced by p/,/uwh. Thus by the same argument as for <n|x>, we 
find [using (8.38) ] that 


(8.39) 


<n|p> = r( (8.41) 


Li al i 
H,( —= re p?) 2noh 

moh pon! J hu 

We have seen that the eigenvectors |n) of the energy operator H for the 
harmonic oscillator have the very particular property that the transition 
coefficients (8.33) between these vectors and the x-basis are the same as the 
transition coefficients (8.41) between these vectors and the p-basis except 
for a phase factor. 
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By the same argument as above for the operator Q, we conclude that the 
spectrum of P is continuous, 


spectrum P = {p| —0o < p < o0}, (8.42) 


and the | p> are generalized eigenvectors. 
The transition coefficients <p|n)> in 


In> = | dp p> <pin> (8.43) 


are called the wave functions in the momentum representation and are denoted 
bp) = <p|n>. (8.44) 


The probability of finding the value p in a measurement of the momentum P 
in the energy eigenstate A, of the oscillator is w,(p) = |<p|n>|?. Also, for 
any arbitrary vector ¢ the transition coefficient 


op) = <ple> (8.45) 

in 
9 = [aplp><pie> (8.46) 
is called the momentum-space wave function or the wave function in the p- 


representation for the state 9. 
So far, we have obtained the matrix element of Q in the x-representation, 


<n|Q|x> = x<n|x); (8.47) 

or more generally, for every well-behaved vector 9, 
<p|Q|x>=x<o|x> — <x|Q1 > = x<xl). (8.48) 
Likewise, we have obtained the matrix element of P in the p-representation, 
<n|P\|p> = p<n|p>; (8.49) 


or more generally, 
<e|Pip> = p<elp> — <P P1@> = p<pl@>. (8.50) 


We now want to calculate the matrix elements of P in the basis of 
generalized eigenvectors of Q and the matrix elements of Q in the basis of 
generalized eigenvectors of P. We do this in two steps: First we find out 
what the symbol <x|p>, the transition matrix element between the x-basis 
and the p-basis, represents, and then we determine <x|P|n>. 

We begin by taking the scalar product of (8.16’) with |p> [or, more pre- 
cisely, we will consider ¢, as a functional at the generalized eigenvector 
| p> € ®*, p € spectrum P, and use (8.16’)]: 


<piny = | dx <pix><xin): (8.51) 
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and then we take the scalar product of (8.43) with <x| [or more precisely, 


consider ¢,, as a functional at the generalized eigenvector 
|x> € ®*, x € spectrum Q, 


and use (8.43)]: 


xin = [dp <xIp><pin), 


(8.52) 


In (8.51) and (8.52), <x|n> and <p|n> are given by (8.33) and (8.41), re- 


spectively. The Hermite polynomials have the following property'®: 


+ ein ; 
ie" /2H (n) = { ds Fee SPH 
a Tt 


Inserting (8.41) and (8.33) into this relation, it follows that 


eixplh 


Gin i Ca ale 


or, taking the complex conjugate, 


—ixp/h 


pinay = Ja erin. 


Comparing (8.55) with (8.51), we find that the ¢p|x> are given by 


1 . 
<p|x> = ah eer 


In the same way one obtains from (8.52) and (8.53) 


eixplh 
«x|p> = a ,e 
(8.56) and (8.57) together give 


<x|p> = <p|x>. 


(8.53) 


(8.54) 


(8.55) 


(8.56) 


(8.57) 


(8.58) 


[| We emphasize that (8.58) is not trivial, because <x |p) is not a scalar 
product and thus does not follow from the “Hermiticity property 
of the ‘scalar product’ <x|p>.” It is a very particular property of 
the operators P and Q, and is not true for the general case of transition 
coefficients between two arbitrary systems of generalized eigen- 


vectors. | 


'© Lebedev, ibid.. Section 4.12. 
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It is now simple to calculate the matrix element of P in the basis of gener- 
alized eigenvectors of Q, using (8.57): 


eixplh 


<xlPI9> = | dp p<xIp><plo> = feo p= <nlo) 
1 


hd hd 
= [ar=< <xip><loy =-= [arcelpy<ple>. 


Thus 


hd 
<x|P|@> = -~— <x|@>. (8.59) 
i dx 


In the same way, using (8.56), one obtains 


hd 
<piQlo> = —-~— <pl@>. (8.60) 
i dp 


Each vector in the linear space can be fully characterized by its components 
with respect to a basis system. In the same way as one represents the vector 
x in the 3-dimensional space by its component x; with respect to the three 
basis vectors e;, according to x = a €;X;, one can represent the vector 
@ by its component <n|@> with respect to the discrete basis $, = |n> of 
eigenvectors of H according to g = Vireo In><n|@> [Equation (4.8)]. This 
is also true if the basis is a continuous basis, according to 


9 = faxix><xlo). (8.16) 


Thus to the vector g corresponds the function <x|g> = (x) and to the 
vector Pg corresponds the function <x|P|g>. Equation (8.59) then states 
that in the realization of the space of vectors @ by the space of wave functions 
<x|@> = g(x), the momentum operator is realized by the differential 
operator times h/i, and (8.48) states that the position operator is realized 
by the operator of multiplication with the number x. This realization is 
called the Schrédinger representation. 


[It should be noted that the Schrodinger representation could not 
be derived from the commutation relation (2.1a) alone, but required 
an additional assumption: that the operator H in (2.1b) has at 
least one proper eigenvector. This assumption is one of several 
equivalent forms of the additional requirement. Other forms are that 
P? + Q? be essentially self adjoint or that the representation of the 
Heisenberg algebra be integrable to the representation of the Weyl 
group (Weyl’s form of the canonical commutation relation). | 


The Schrédinger representation of the energy eigenvalue equation 


Hos Eo, (8.61) 
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is the time-independent Schrédinger equation. The energy operator for the 
oscillator was given by (2.1b): 


2 
po? 


1 
H = — P? + —— Q’. 8.62 
an oe Q (8.62) 
Let us take the matrix element of H between <x| and |n): 
il 2 
(xl In) = 5° <1 PPI) + Cx1Q? In). (8.63) 
We compute 
<x|P?|n> = ¥ <x|Pln'><n'| P| n> 
hd 
= ae <x|n’><n'|P|n> 
hd 
=-—)) «x|n’><n'|P\n> 
Wee 
hd 
= —— <x|P 
AoE <x|P|n> 
ee 
=e (8.64) 
(*) ax? ae 
Moreover, from (8.1), 
0 |p — x ln (8.65) 


Inserting these in (8.63), we have for the matrix element of the energy oper- 
ator bg 
gc 100? 
{x|H|n> = — ude <x|n> + — x?<x|n> 
h? @® yw? ‘ 
= (a+ <x|n>. (8.66) 


Summarizing, we have seen that in the x-representation, the operator P 
is given by —ih d/dx, 
au 
Pi ite, (8.67) 
dx 


and the energy operator by 
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II.9 Comparison between Quantum and Classical Harmonic 
Oscillators 


As stated above, the wave functions <x|n> and <p|n> give the probability 
densities w,(x) = |<x|n>|? and w,(p) = |<p|n>|* for finding the value x ina 
position measurement and the value p in a momentum measurement after 
the energy has been measured with the result E, = hw(n + 4). For physical 
reasons it is impossible to measure the position exactly at one point x. In 
order to do this, we would have to build an apparatus (counter) that registers 
when the particle is exactly at the point x. But this is impossible, because 
such an apparatus must have finite size. Thus the physical question to ask is, 
what is the probability w,(x — ¢€, x + ©) of finding the position in the interval 
between x — € and x + ¢€ after an energy measurement had given the result 
E,,? This probability is given by 


x+e 
w(x -6x+60= { dx’ w,{(x’) 
x—e€ 


X+€ 
= | dx' |<x|n>|?. (9.1) 
Let us consider the specific cases n = 0 andn = 1: 
(x|n = 0) =) __ ¢- #72, (9.2) 
Xo z 

1 axe x 
<x|n = 1) = ———— e *"**2_, (9.3) 

V%0./q Xo 


where x, = ./h/uo. Then the probability of measuring the position in an 
interval Ax around the value x is given for small Ax by 


3 A 1 =P 
vol _ ns x+ 3) = Wo(x) Ax = =e 0 Ax (9.4) 


2 Xor/t 
and 
Ax Ax De Ax aes 
vir a+ SE) = wiln) Ax = ae 


Figure 9.1 shows wo(x) = Wam, and Figure 9.2 shows w,(x) = Wgm- In order 
to compare the quantum-mechanical results with the classical ones, w,)(x) 
is also shown in the figures, where w,,(x) Ax is the probability for finding 
the particle of the classical oscillator in the interval [x, x + Ax] when its 
energy is Ey or E;. 
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0 


Figure 9.1 Classical and quantum probability densities for the state of 
an oscillator with the lowest energy Ey. 


A simple comparison with the classical situation may be seen in the 
frequently used diagrams that display both the energy levels and the poten- 
tial-energy function. For the potential energy 


U(x) = 4um?x? (9.5) 


of the oscillator this is shown in Figure 9.3. Let us consider, for example, the 
level E,. According to classical mechanics a particle with energy E, could 
be found only in the range AB, since A and B are the turning points, ice., 
the points at which the potential energy is equal to the total energy. At these 
points the kinetic energy T is zero, since 


E=T+U ie — U- (9.6) 


Evidently OA = OB is the amplitude of oscillation of a particle having 
energy E,. 

The classical probability distribution is found in the following way: The 
period of oscillation is°t = 22/@. The probability of finding the particle 


Figure 9.2 Comparison of the quantum position probability density 
Wam Of a particle (for n = 1) with the classical density w,,. A, B: turning 
points; A’, B’: points of maximum wen. 
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U(x) 


n=2 E, = Sha/2 


E, = 3he/2 


1 
1 
i] 
i] 
: 
| Ey = heo/2 
1 
1 
A 0 pe 


Figure 9.3. Diagram of quantum levels E,, and potential energy 
U(x) = 4um*x? for a harmonic oscillator. 


inside Ax is given by the fraction of time At/t that the particle stays inside 
the interval Ax: 


At @ Ax 
W.(x) Ax = — = — —_—, 9.7 
(x) ar (9.7) 
where v is the velocity. Since 
x =a sin wt, 
where a = \/2E/y)? is the amplitude of oscillation, it follows that 
v = aw COs ot. 
If we express the velocity as a function of x, we obtain 
2\ 1/2 
bx) = ao(1 — a ; 
a 
and inserting this in the above gives 
A 9.8 
w(x) Ax = one (= x2/a?)'/? e (9.8) 


for —a < x < a, and zero otherwise. 

w.,(x) is shown in Figure 9.2; as we would expect, the greatest probability 
occurs at the turning points A and B. Comparing this with the quantum 
probability density w,,, in the same figure, we see that the latter has maxima 
closer to the equilibrium point (to be precise, for E, = 3hw/2,OA = onR= 
./3h/uo and OA' = OB = ,/h/po), but unlike the classical case, the 
probability of finding the particle is not zero even beyond the turning points. 
If w(x) were different from zero for x > a= ./2E/po, this would mean 
that the potential energy U = (uw?/2)x? > E, the total energy. However in 
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quantum physics Q, and therefore Q? and U, are not simultaneously measur- 
able with H: after Q has been measured the system is no longer in a state 
with energy value E. Thus, no contradiction arises from wgm(x) > 0 for 
eee g! 

Let us consider the lowest energy state of the oscillator, where the difference 
between the quantum and classical cases is particularly striking. In the 
classical theory the minimum energy of the oscillator is E = 0, corresponding 
to a particle at rest at the equilibrium position. The position probability 
density w,,(x) associated with this state has the form depicted in Figure 9.1, 
being everywhere equal to zero except at the point x = 0. In quantum 
theory the minimum energy of the oscillator is 


Ey = tia, (9.9) 


which is known as the zero-point energy. The position probability density 
Wam(X) associated with the state of minimum energy has its maximum at 
the point x = 0, but is not equal to zero everywhere else. 

We shall now show that these nonclassical phenomena are direct con- 
sequences of the fact that momentum and position in quantum mechanics 
are not simultaneously measurable. 

If we take the expectation value of the energy relation (2.1b) for the oscil- 
lator, we obtain 


CED <P?) a "<0 (9.10) 


We have the uncertainty relation 


2 


<P?) <Q?) = disp(P*) disp(Q*) > (0.11) 


because, according to (6.3) and (6.4), <P> = <Q» = 0 for every state of the 
oscillator. Thus, because of the uncertainty relation (9.11), the expectation 
value of the energy <H> cannot be zero. If we insert (9.11) into (9.10) we 
obtain 


poh? 1 
ERG a 


<H) > = <P?) + (9.12) 


To obtain the smallest expectation value for the energy, we find the minimum 
of the equality part of (9.12). The condition d¢H>/d<P*> = 0 gives 
CH min = 2 hoo. (9.13) 


Thus the ground state of the oscillator is in fact the state with lowest ex- 
pectation value of the energy, and the eigenvalue E, in this state is the lowest 
energy compatible with the uncertainty relation. 
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11.10 Basic Assumptions IT and III for Observables with 
Continuous Spectra 


The description of the basic assumptions II and III in Sections 11.4 and II.5 
was given in terms of operators with discrete spectra. In this section we shall 
give the description for the case that the observables have continuous 
spectra. With the help of the formalism introduced here we shall be able to 
discuss in more detail the problems connected with the measurement of 
position and momentum, the subject of the next section. 

The statistical operator of a pure state, 


W = Ay, = |bo><bol, 


where A,, is the projection operator onto the one-dimensional subspace 
spanned by the (proper) eigenvector |b,» of an observable B, is written in an 
arbitrary basis system {@,,} as 


W =) cr |Pm><Pnl> (10.1a) 


where 
Cn = <Py| Do». (10.1b) 


If {¢,,} is the basis system {@, = |b} in which B and therefore W is diagonal, 
then 


Ch = <4|bo> = <b|bo> = Onno: (10.1b’) 
For a mixture the statistical operator is 
W= 3 Wmn| Pm? <Prls (10.2) 


where w,,, are the matrix elements of W in the basis {@,}; the collection 
{Wn} 1S the density matrix. 
In the case of a continuous spectrum Equation (10.1a) has to be replaced 


by 
ile [ax ax FOF koe Ie (10.3a) 


where {|x>} is the basis system of generalized eigenvectors of an observable, 
say Q, with a continuous spectrum. The (well-behaved test) function f(x) 
of the continuous variable x for the pure state W = A,, is given by 


f(x) = <x|bo> = <x bo). (10.3b) 


Here we have used the second notation to emphasize that |x is a generalized 
eigenvector and |b) is a proper eigenvector. For a mixture the statistical 


operator is in analogy to (10.2): 


Ne [ax ax Hire ES x5 (10.4) 
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where F(x’, x”) is a function of the two continuous variables x’ and x”, the 
generalized eigenvalues of Q. 

Let us now assume that the observable B that was used to prepare the 
state W is Q, and that we want to prepare and describe a pure state. Naively 
one would want to choose 


FG) = Glko = 0% — Xo); 
in analogy to the discrete case (10.1b’), so that 
W =A,,, = |X0><Xol. 


However, it does not make sense to ask for a measurement at a particular 
point of an observable with a continuous spectrum. Any measurement 
apparatus, a macroscopic system, is of finite size, not infinitesimally small. 
Counters have finite extension and need a finite amount of energy; slits—as 
narrow as they may be—have finite extension. Therefore one cannot ask 
for a state of a physical system in which the measurement of an observable 
Q with a continuous spectrum [say spectrum Q = (—x,+%)] gives 
exactly one particular value, because such a state can be neither measured 
nor prepared. It only makes sense to ask for a state of a physical system in 
which the measurement of the observable Q gives values x lying in some 
particular interval xp — € < x < x9) + € around a mathematically defined 
point xg. (Physically neither an exact point nor an exact interval, 1e., an 
interval with precisely determined endpoints, can be defined.) Thus for 
physical reasons, the analogue to (10.1a) for an observable Q with continuous 
spectrum is 


W= [av dx! Lexie sale oe (10.5) 


where f(x, X9) is a function that has its main support in the neighborhood 
(x9 — 6X + ©) of the point x and describes in a more or less direct way 
the resolution of the nfeasurement apparatus by which the state was pre- 
pared. 

For the state W given by (10.1a), the probability that a measurement of B 
will yield the value b is 


Tr(A, W) = (b| W[b) = (b| bo) (bo |b) = Onbo Obob = Obbo: (10.6) 


For the state W given by (10.5), the likelihood that a measurement of Q 
will yield the value x is described not by a probability but by the probability 
distribution (probability density) 


Kx |x? 


[dx dy fxs x0)f 200s Xo) Cela") Cx" |X> 


[aw dy! fAx's Xo) fE(y's Xo)(x — x')O(y' — x) 


= AX; Xo) fE(%; Xo) = | fxs Xo) |? = F(x; Xo). (10.7) 
Here we have used (8.19) Lor (8.23) if dx is symbolic for du(x)]. 
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Symbolically we may write, in analogy to (10.6), <x|W|x> = Tr(A, W) 
with A, = |x><x|; however, one has to keep in mind that A, is not a projec- 
tion operator onto a subspace of the space of physical states. The physical 
question, i.e., the question that has an answer that can be compared with an 
experimental determination, is: What is the probability of obtaining a value 
x in the interval between x, and x, when a measurement of the observable 
Q is made on the state W? Here the interval (x,, x2) is determined by the 
resolution of the apparatus with which the measurement is made, and again 
cannot be infinitesimally small and exactly determined. 

The operator which represents finding the position in the interval x, < 
x < xX, is the projection operator 


heer) = | eG: (10.8) 


the probability of finding the position anywhere in x, < x < x, when the 
system is in the state W of (10.5) is then 


Tr(A(x,, X2)W) = fax <x'|A(x,, X2)W|x’> = { dx <x|W|x> 


= { Ax feel | dx F (x3 Xo). (10.9) 


The function F(x; Xo) = | f(x; X)|? is determined by the experimental 
situation. It describes the resolution of the apparatus by which the state W 
is prepared. The better the resolution, the smaller is €, and in the idealized 
but unphysical limit € — 0, 


| f(x3 Xo)|? = Fx; Xo) > O(x — Xo). (10.10) 


The limit (10.10) would correspond to an apparatus with infinite resolution, 
which cannot exist in reality. Thus the state W in (10.5) can be only as “pure” 
as the experimental resolution allows, and that can never be “ ideally pure” 
as described by a 6-function. 

In order that the state W given in (10.5) be normalized, f,(x, xo) must 
fulfill 


+ © + 00 + 0 
TrWw= [dx <xiWixy | di | fs. xo)? = J dx Fl. x0) = 1 


(10.11) 
Examples for F (x) = F (x, 0) are (function sequences of 6-type): 
F(x)=+ << (Figure 10.1), (10.12) 
: nm x? + é? 
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Jat | 
=-—— -_--—= Figure 10.2), 10.13) 
FF (x) = Jn € exp/ 2 2 ( & ) ( 
Oni x = 
1 —_— — 
———_ 
1S. € € ‘ 
F(x)=%7- if- 5 <x<+ 5 (Figure 10.3), (10.14) 
€ 
O if + ee 
1 = 
5 x 
{ 0 ) Ailes Se 
ae ee xe 
G 
F(x) = (Figure 10.4). (10.15) 
=r. 
= 5 Oe cere 
é 
0 if+e<x 


All these functions have their main support in the interval —¢ < x < +€ 
and have the property 


F(x) (x) ase 0. 


Thus we have arrived at the basic assumption IIIa for continuous spec- 
trum: The statistical operator W given by (10.5) with a distribution function 
F(x) as narrow as experimentally possible is the continuous-spectrum 
analog of a pure eigenstate for an observable with a discrete spectrum. 

We shall call such a state an “almost pure” state and an “almost eigen- 
state” of the observable with continuous spectrum. 

An arbitrary state, in whose preparation no attempt at “purity” may have 
been made, is described by (10.4). F(x’. x”) is connected with the probability 
distribution function of the observable Q by 


TrA,W = <x|W\|x> = fav ax Fixx )exlx <x |x = ee): 
(10.16) 


This is the appropriate place to mention the interrelationship 
between the physical measurability and preparability of a state and 
the mathematical description of quantum physics in a rigged 
Hilbert space ® c # < ®”, though we cannot give a detailed 
description here.'” 


'” An elementary discussion of this subject is given in A. Bohm, The Rigged Hilbert Space 
and Quantum Mechanics, Springer Lecture Notes in Physics, vol. 78 (1978). 
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Figure 10.1 Graph of F(x) = (1/n) €/(x? + €?) for two values of e. 


Figure 10.2. Graph of F(x) = (1/,/22)(1/€) exp(—3x?/e?). 


A 


é 


Figure 10.3 Graph of F(x) in Equation (10.14) 


Figure 10.4 Graph of F(x) in Equation (10.15). 
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The resolution function F(x; Xo) or f(x; Xo) that corresponds 
to a realistic experimental apparatus must be a well-behaved func- 
tion; e.g., it can be something like that given in (10.13)-(10.16) with 
a small but finite € for the description of a state as “pure” as the 
preparation allows. One can make this precise by requiring that 
f(x; Xo) be an element of the Schwartz space, which is a realization 
of ®. More appropriate still may be to require that F(x; xo) have 
compact support. Those f(x; x9)’s that are either distributions or 
elements of the Hilbert space (i.e, Lebesque square-integrable 
functions that are determined only up to their values on a set of 
measure zero) cannot describe the resolution of a realistic apparatus 
(e.g., the detector efficiency). 

If f,(x, Xo) is an element of the Schwartz space, then the state one 
obtains by (10.5) or (10.4) is in the space ® and not in either # or 
®*. More precisely, W is a positive definite Hilbert-space-bounded 
trace-class operator with WH < ®. If W is a pure state, W = 
|f><f|, then f(x) = <x|f> given by (10.3) is an element of the 
Schwartz space, and | f> is an element of ®. Thus the requirement 
that the space of physical states consist of only those states that can 
be prepared in a realistic experiment means that in a precise formula- 
tion of the preceding basic assumptions the space of physical states 
should not be the Hilbert space #, but the nuclear topological 
space ® of the rigged Hilbert space ® <c # < ©”. Though the 
elements of ®* which are not in ® do not represent physical states, 
@®”* is of immense value, as it contains the generalized eigenvectors of 
self-adjoint operators with continuous spectra, which are not 
elements of #; cf. the mathematical note [| : || in Section IL8. 

The Hilbert space # is infinite-dimensional, and it is infinite- 
dimensional in a very particular sense; namely, it is complete with 
respect to a particular topology, that is, with respect to a particular 
meaning of convergence of infinite sequences. Since an infinite 
number of states can never be prepared, physical measurements 
cannot tell us anything about infinite sequences, but can at most 
give us information about arbitrarily large but finite sequences. 
Therefore, physics cannot give us sufficient information to tell how 
to take the limit to infinity, 1.e., how to choose the topology. The 
choice of the topology will be a mathematical generalization, but 
then one should choose the topology such that it is most con- 
venient. 

With the choice of ® as the space of physical states, one does not 
only obtain a description which is closer to reality for the above- 
mentioned reasons connected with measurement and preparation 
of physical states. One also obtains a tremendous simplification of 
the mathematical description: Even the simplest algebra of operators 
that appears in quantum mechanics, the one fulfilling the relation 
(2.1a), cannot be represented by #-continuous operators. Con- 
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sequently there are vectors that lie outside the domain of the defini- 
tion of an observable that is represented by an unbounded operator 
in #, e.g., the vector that would correspond to a state of infinite 
energy. In contrast to this, it seems possible that for all physical 
systems the observables can be represented by ®-continuous 
operators. Thus in the rigged-Hilbert-space description the mathe- 
matical image of a physical system is an algebra of continuous 
operators, and all the mathematical manipulations that physicists 
perform can be rigorously justified. 


After having considered the continuous-spectrum analogue (10.5) of 
(4.31) in the axiom IIIa we shall now consider the basic assumption IIIb for 
continuous spectrum. Let the physical system be in a state W; this state 
may have been prepared by measurements of an observable with a con- 
tinuous spectrum or with a discrete spectrum, or may even be completely 
unspecified. If a measurement of an observable Q with continuous spectrum 
—co <x < +00 is made on the state W and is used to select out a sub- 
ensemble with values x of Q lying in the range x; < x < x3, then the (un- 
normalized) state W’ that describes this subensemble is 


i A(x, x, )WA(x,, X2) 
= { dx’ { dx Wes |W |x OMX (10.17) 


This is the continuous spectrum analog of (5.2). Note that for x; > —©, 
X, 7 +00 we have W’ = W, which says that no measurement has been 
performed. Again, x, < x < x, must be a finite-size interval; it cannot be 
infinitesimally small, due to the finite resolution and finite extension of the 
measuring apparatus. 

To show that (10.18) represents a state where measurement of Q must 
yield a value x in the interval x; < x < xz, we compute the probability 
density for getting the value x, which according to (10.7) is given by 


| 1 
oe W' = —— ¢<° Ww’ 
(ALY = py lt As ae et 


_ —- [a I. dx" Ch x!) Cx! | Wx") Cx" |x) 


if le XP Or a XD (10.18) 


Cir jarcx|W\x> ix) = x =< x2. 


An actual position measurement is over some range of values in a small 
interval x9 — € < xX < Xo + €. The probability for measuring Q to be in 
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such an interval is 


<A(Xo — & Xo + ©)” 


xo + € 
= i oe 

0 if (xo — & Xo + ©) (Atk) = 
= xo + € * 

Fa | BGIWID iG — 6x9 FOS (1%) 


(10.19) 


Thus W’ represents a state where the position is in the interval (x, x2). 


11.11 Position and Momentum Measurements—Particles and 
Waves 


The material of Section IJ.10 has provided us with a formalism in which we 
can discuss problems connected with the measurement of position and 
momentum in more detail. Let us return to the classical-mechanical model 
of the harmonic oscillator, which consisted of two mass points vibrat- 
ing around an equilibrium point. For simplicity we assume that m, > m,, 
so we can consider m, to be at rest. The position x and momentum p are then 
essentially the position and momentum of mass point 1. 

How can a measurement of the position be made on the quantum-mechani- 
cal harmonic oscillator? We have considered the diatomic molecule as a 
realization of this system. The question is, then, how do we measure the 
position, i.e., the separation between the two atoms of the CO molecules in 
the collision chamber? There seems to be no way one can think of to set up a 
measurement apparatus that would do so. The same argument also applies 
to a momentum measurement on this ensemble. Thus the above discussion 
in Section 8 of the probabilities w(x) and w(p) is not of much practical value 
if one restricts oneself to the quantum-mechanical harmonic-oscillator 
system. With the formalism developed in Section IJ.10 we can give a theor- 
etical explanation of this situation. 

According to (10.17), after a measurement of the position operator Q, 
the physical system is in an almost eigenstate of the position operator; 1.e., 
all molecules in the ensemble have a position localized in the interval 
(x — €,x + ©). For our classical model of the CO molecule as two oscillating 
mass points (with m, > m, for simplicity), the above statement means that 
m, is located at a distance X from m, that lies in the interval (x — ¢, x + €). 
However, a classical system for which m, is located in a restricted domain 
of space is not an oscillator. It is a localized mass point, which we call a 
particle. A measurement of the position operator Q on the harmonic oscil- 
lator would destroy the oscillator and result in a physical system that has as 
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its classical image a particle. Thus the fact that it is impossible in practice 
to set up an experiment that would measure the observable Q on the en- 
semble of CO molecules in the collision chamber has a theoretical descrip- 
tion in the basic assumption IIIb as described by (10.18). 

The above arguments also exemplify the fact that for a particular physical 
system, such as the vibrating CO molecule, there are different kinds of ob- 
servables. Of one kind (e.g., the energy operator of the vibrating CO mole- 
cule) one can easily prepare eigenstates of the particular physical system. 
Of the other kind (e.g., the relative-position operator of the vibrating CO 
molecule) the preparation of eigenstates will destroy the system. 

The measurement of this second kind of observable can only be described 
if one enlarges the physical system. Thus in our example we need not restrict 
ourselves to CO molecules, but can consider the larger physical system that 
consists of CO molecules, O atoms, and C atoms. Then the CO molecules 
are just particular states of this larger physical system. Other states are the 
states in which the C and O atoms are not bound. In this larger system, the 
measurement of the position (the distance between C and O atoms) is always 
possible at least in principle. 

For the almost pure eigenstate of the position operator Q, we have 


W(x) = [Jax dx" Pix’ — x)~dx" — x)|x><x" |, (11.1x) 


where Q|x> = x|x> and |A(x' — x)|? has the property of the functions 
(10.12)-(10.15). One can define the almost pure eigenstate of the momentum 
operator P, 


W(p) = ie dp’ [ dp" f(p' — p)k(p" — p)|p"><p"|, (A.A) 


where P|p> = p|p> and | fp’ — p)|* is again a function of the kind given 
in (10.12)-(10.15). Q and P here are related by (2.1a). In such a state the 
measurement of the momentum P gives a value p in the interval p — € < 
p <p + «€. By the same arguments as above, such a state cannot be realized 
by the oscillator; therefore we do not want to require (2.1b), and shall just 
require that Q and P be related by (2.1a) and that as a consequence!® of 
this <p|x) is given by (8.56). A possible relation between the energy operator 
H and the P and Q that facilitates the preparation of almost momentum 
eigenstates is H = P?/2m, where mis a system constant, the mass. A quantum 
physical system obeying this relation is called a free nonrelativistic elementary 
particle. 

A quantum-mechanical system with well-defined momentum has as its 
classical image a wave. 

Let us discuss this last point a bit more. The characteristic or defining 
property of a particle is that it is localized in space, i.e., it can be assigned a 


18 As remarked in | J at the end of section 8, (2.1) is not sufficient to derive (8.56). One has 
to make an additional assumption, e.g,, that P? + Q? is essentially selfadjoint. 
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definite position. In the same way, the characteristic or defining property 
of a wave, in particular a plane wave, is that it has infinite periodic spatial 
extension. The mathematical description of a standing plane wave is 


@(x, t) = Ae™*e7*, (11.2) 
so the periodic spatial extension is described by 
ue Aes d13) 


where A is the amplitude and k is the wave number. The fundamental period 
or wavelength is A = 27/k. If the wave is not a plane wave, i.e., does not have 
a well-defined wave number or wavelength but has wave numbers k’ inside 
a certain interval (k — Ak, k + Ak), then one has a wave packet described 
by 


P(x) = [ “dk A(k’)ei*™, (11.4) 


where A(k’) is nonzero only in the interval 2 Ak about the value k. 

To see that a quantum-mechanical system with well-defined momentum 
has as its classical image a wave, let us calculate the spatial distribution of a 
state with well-defined momentum. Let us first do this simple-mindedly, 
working with “states” having exact momentum |p> and “states” having 
exact position |x > (which are “described” by W(p) = dp|p><p| and W(x) = 
dx|x><«x|, respectively), though we should keep in mind that such “states” 
are not physical states. 

For a pure state A, with “well-behaved” state vector wy, the spatial 
distribution is given by the wave function w(x) = <x|W>; according 
to (6.25) the probability density for measuring the value x in a position 
measurement of the state w is w,(x) = |<x|>|?. Now let w be the “not 
well-behaved” and therefore unphysical “state” vector |p>. The spatial 
distribution is then given by <x|p>, which is, according to (8.57), 


<x|p> = 


—— ef *PIh (11.5) 
/ anh 
If we compare this with the spatial distribution of the plane wave given by 
(11.3), we see that an “exact momentum eigenstate ” of a quantum-mechanical 
system has the same spatial distribution as the classical system called a plane 
wave, with the wave number and wavelength given by the de Broglie relation: 
2mh h 


1 
k=-p and A=— 


; anit (11.6a) 


This is the justification for calling an ensemble of quantum-mechanical 
systems with definite momentum p a wave with wavelength A = h/p. 

Let us now return to the physical case where the state is given by (11.1p) 
and is an almost momentum eigenstate with as narrow a momentum dis- 
tribution | f(p’ — p)|* as possible. The probability density for obtaining 
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the value x in a measurement of the observable Q in the state W(p) is given 
according to (10.17) by 


<x|WOpIx> = | de’ | do” £0" — pdRlo" — vX<xIP'><P"lw> 


eo ixp’/A 


enh 
In order to compare this with the wave function for a (classical) wave packet 
(11.4), we define 


+ 00 ; } eixp'/h + 00 
= | ar 50) | ae lee’ - (11.7) 


$6 ) im eixP'/f 
x)= d ' : Hess ae 
ae FP’ — p) Tinh 
+ 00 Oa oe 
- dk'h i) ee (11.8) 


—o© / 27h 


h 
A(k) = ft fA — be 


$(x) indeed describes a wave packet (11.4), and the intensity of this wave 
gives, according to 


where p’ = hk’. With 


<x|W(p)x> = 16)’, (11.9) 


the probability density for the value x. 

Thus a quantum-mechanical system in a state with well-defined momen- 
tum (in a narrow momentum interval) has as its classical image a wave 
packet with wavelength 4 in a narrow interval: 


2nh 
Tl ey. 2nh 


; 11.10 
Daze Nap E ( ) 


The intensity of this wave corresponds to the probability density for the 
measurement of the position. ¢(x) describes the probability and is called 
the probability amplitude. 

We now understand why W(x) = <x|> has been called a wave function: 
Let W, be a pure state, i.e., W, be a projector onto the space spanned by w. 
Then the probability density for obtaining the value x in a measurement 
of the position is 


W(x) = <x|Wylx> = <xlYoC lx? 
= W(x) Wx) = [WO /. (11.11) 
On the other hand, we can calculate <x|W,|x> by inserting a complete 


system of generalized states 


7 [ap ip’><o' 
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to obtain 


Ox |Wylx) = | dp’ dp"<x\p'><p Wb lp"><P'IxD 


= { i ae i ip’ Wp"). (ALA) 
./2an ./ 2th , , 


If we compare (11.11) and (11.12), we see that the wave function, whose 
physical interpretation was that its absolute value squared is the probability 
distribution, is given by 


eixP Uh 


WO) = jo dp’ — Wp) (11.13) 


with w(p’) = <p'|>. Thus the wave function W(x) may be considered as a 
superposition of plane waves, w(x) comes closer to being a plane wave as 
the transition coefficient <p'| > between the pure state W and the generalized 
momentum eigenvector | p’> comes closer to being a 6-function d(p’ — p). 

The wave character of the quantum system can be experimentally dis- 
played by diffraction experiments. Let us consider a system that has been 
prepared in an almost eigenstate of momentum, i.e., a monochromatic beam 
of quantum particles. Such a monochromatic beam is easy to prepare if the 
particles are charged, but harder to prepare if they are neutral (like atoms 
or molecules). Let us therefore consider first a monochromatic beam of 
electrons, such as that prepared for the electron-loss experiment that we 
considered before. The electron beam is produced by a filament, then ac- 
celerated by an electric potential, and then passed through crossed electric 
and magnetic fields of very well-defined field strengths. Only electrons with 
very well-defined momenta are not deflected. If f,(p’ — p) describes the 
momentum spread of This beam, then the spatial distribution is given by 
(11.8): 


G(x) = [avs Jak = SAE = 9) ree (11.14) 


/2n 2nh 
which becomes closer to being a plane wave, 
P(x) = Ae*, 


as the momentum spread about the value hk becomes narrower. This ap- 
proximate plane wave may be diffracted by a grid, and one obtains a typical 
diffraction pattern. 

Related to Equation (11.6a) by special relativity 1s another equation: 


1 
o=-E, (11.6b) 
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where w is the circular frequency, w = 2zv. At this point, where we do not 
yet want to consider quantum-mechanical time development, the relation 
(11.6b) is best obtained from the dispersion law for our waves, w = f(k), 
and the relation between energy and momentum, E = f(p). Thus, for the 
system “electromagnetic wave-photon” the dispersion law is w = ck (v = ¢/A), 
and the relation between energy and momentum is E = cp [E? — (cp)? = 
(mc)? with rest mass m = 0]. From these classical relations (the first for a 
wave, the second for a massless relativistic particle) it follows immediately 
that (11.6b) is a consequence of (11.6a) and v.ce versa. For the system 
“electron de Broglie wave-nonrelativistic electron” the dispersion law is 
w = k?/2(m/h) and the relation between energy and momentum is E = 
p’/2m, which again shows that (11.6a) and (11.6b) are consequences of each 
other. 


The two physical systems “electromagnetic wave-photon” and 
“electron de Broglie wave-electron™ are two examples of physical 
systems called elementary particles.'? The historical development of 
the ideas about these two systems was, however, quite distinct: To 
begin with, all known experimental facts about light could be 
explained by a particle theory (Newton, 1663) as well as a wave 
theory (C. Huygens, 1678). The same domain of physical knowledge 
could be described by two different pictures. Then, diffraction of light 
was discovered (Young, 1801), which could only be explained by 
the wave picture, not the particle picture. The cathode rays, on the 
other hand, though considered by some to be, like light, due to a 
process in the ether, were soon proven to behave like negatively 
charged particles in an electric and magnetic field (J. J. Thomson, 
1897: J. Perrin 1897). Thus light was a wave and electrons were 
particles. Had the photoeffect (P. Lenard, 1902) been discovered 
before the diffraction phenomena of light, and had the electron 
diffraction (Davisson and Germer, G. P. Thomson, 1927) been the 
first effect observed with the cathode rays, the situation might have 
been just the reverse. 

As light was considered a wave, one had the dispersion law 
w = ck, or w = c/A, and Einstein (1905) showed that in order to 
explain the photoeffect one has to associate with light, besides the 
energy given by Planck’s (1900) formula E = hw, the momentum 
p= Esc. 

Electrons, on the other hand, were considered particles, and one 
had the energy-momentum relation E = p?/2m. But de Broglie 
(1924) postulated that they are waves, with a wave number given by 


19 1t would have been better to coin a new name for these physical systems (e.g., “quanta”) 
to avoid any reference to Newtonian particles. 
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(11.6a). Then he derived the dispersion law for these waves from the 
Planck formula (11.6b) to be w = k?/2(m/h). 


The two relations 
E=ho p = hk (11.6) 


are justly considered the gate to quantum theory. We derived at least the 
second equation (11.6a) from the defining relation (2.1a). But historically 
they were the starting point of-the long conjecturing process that led to 
(2.1a) and ultimately to quantum mechanics in general. The relations (11.6) 
connect two different classical pictures (often called dual). 

The one picture is a particle, ie., a localizable mass point. Classically, 
such a particle can be given definite values of energy and momentum (e.g., 
the momentum and the energy measured by an apparatus that moves with a 
velocity —v relative to such a localized mass point are p = mv and E = 
p?/2m). 

The other picture is a wave, i.e., a more or less periodic disturbance of 
infinite spatial extension. A wave has a frequency w and a wave vector k. 

In quantum mechanics the physical system (quantum, elementary par- 
ticle), of which we should always imagine an ensemble, can occur in different 
states. For instance, it can be in an almost momentum eigenstate described 
by (11.1p). We may then ask about the position of such a state, e.g., we may 
want to know the probability of finding the position anywhere in an interval 
x — A/2 < x’ <x + A/2. This is according to the basic assumption II, 
expressed with the projection operator of (10.8) as: 


\ xt+A/2 
Tr (1 (x - = x+ 5) Tm) = i dx <x |W <>. 
A/ 


Safa |?) 


For the sake of definiteness we may assume that the almost momentum 
eigenstate is given by (11.1p) with F,(p’ — p) given by (10.14) or Figure 10.3, 
so that f,(p’ — p) = 1/,/¢ in the interval between p — ¢/2 and p + ¢/2. The 
probability density of the position operator is then obtained using (11.7) as 


pte/2 1 erin pt+e/2 1 e ixp'sh 
Cx|WOpIxy = fd’ [apt 
p-e/2 wen /2nh /p—e/2 J € / 2nh 


1 
— / 
oe 4 € for ex/h < 1. 
This means that the probability density of the position operator is indepen- 
dent of the position. Thus the probability of finding for the physical system 
a value in the interval of length A around x is 


x+A/2 1 
{ ae AW) ee 
2n 


x—A/2 h 
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i.e., the system is located with the same small probability anywhere in space. 

In the limiting case of exact momentum when the momentum spread 
€ > 0, the probability of finding the physical system in an interval A any- 
where would be zero. However, this limiting case is unphysical, because 
there is no apparatus that could prepare such a state. For example, an almost 
momentum eigenstate of electrons, a monochromatic electron beam, can 
be prepared by the apparatus used in the energy-loss experiment of Section 
II.4. The electrons produced by a filament are accelerated by an electrical 
potential. Even if this potential is very well detined, the electrons will have 
still the same momentum spread with which they were emitted by the fila- 
ment. This electron beam then passes through a monochromator (an electric 
or magnetic field, or in some cases crossed electric and magnetic fields), 
and only the electrons with a definite velocity or momentum will arrive 
at the position of the slit. But again, even if the field is very well defined, this 
slit has a finite opening, so that electrons with slightly different momenta 
can pass through. In general, because of the necessarily finite size of the 
macroscopic measurement apparatus, the preparation of a monochromatic 
beam with zero spread is impossible. 

As we have seen above, for an almost momentum eigenstate every position 
has the same probability, ie., the position is undetermined. The physical 
system is in a wave state with a rather well-defined wave number and wave- 
length; thus it appears as a wave. 

Thus, according to the formulatiort of quantum mechanics based upon the 
basic assumptions I, II, and III, a quantum physical system in a momentum 
eigenstate behaves like a wave—it “is” a wave —until a measurement is 
performed that alters the state (e.g., a position measurement). As long as 
such a measurement is not performed, the theoretical predictions are exactly 
the same as in a wave theory. 

We can go through the same arguments starting with an almost position 
eigenstate W(x) described by (11.1x). The probability density of the momen- 
tum operator is then obtained as 


1 
(p|W(x)|P> = 55 & 


and the probability of finding the momentum in an interval between p — ¢/2 
and p + é/2 is 


pte/2 1 
{ a \Weoles = =~ ee. 


poe 2mh 


As this probability does not depend upon the central value p, any value 
of p has the same probability. For an almost position eigenstate the mo- 
mentum is undetermined. Such an almost position eigenstate we have called 
a particle, but we see now that, in distinction to a classical (Newtonian) 
particle, the quantum physical system in a particle state cannot have a 
definite value of momentum. 
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The constant h was introduced in the defining relation (2.1a) without 
much discussion. With the consequences of (2.1a) derived here, we can now 
attempt an explanation of its physical meaning. 

One can prepare an almost momentum eigenstate, e.g., by the macroscopic 
apparatus of Section II.4. From the electric and magnetic fields of the ap- 
paratus one obtains the velocity v and the momentum p = mv. According 
to (11.6a) such a state appears as a wave with inverse wavelength k = (1/h)mv. 
The wave character can be displayed experimentally, and the wavelength 
can be measured in a diffraction,experiment. For electrons 


(m= 9 x 10 7:9), 


it turns out that grids with the right grid constant are crystals, whose typical 
separation of adjacent atoms is of the order of 1 A = 107?° m. In the dif- 
fraction experiment 1 = 1/k is measured in the units of length (cm). The 
traditional units of momentum are erg sec/cm; if the energy is measured in 
eV,?° then the units of momentum are eV sec/cm. The de Broglie relation 
(11.6a) states that the inverse wavelength k of the probability waves and the 
momentum p are the same quantity measured in different units. 

The Planck constant fh may then be obtained as the ratio of the value of a 
momentum in erg sec/cm or eV sec/cm to its value in cm” '. The result is 


h = 6.63 x 10°27 erg sec, 
h = 1.0546 x 10-28 erg sec, 
or 
h =x6.58 «x 107 SV see, 


It could, of course, be that # has a different value for different physical 
systems, as was the case for the system constants « and @ in Section II.2 and 
for m in the present section. It turns out that in fact h in the relation (2.1a) 
has the same value for all physical systems for which (2.1a) holds. Therefore 
one can change the units of either P or Q in (2.1a) so that in the new units 
one has [P, Q] = —il. For example, one can decide to keep cm as the unit 
of position; then P is measured in cm~'. Or, as one does in relativistic 
particle physics, one can choose for the unit of momentum 1 eV/c, where c 
is the velocity of light, another constant of nature. Then the unit of position 
is (1 eV/c)~!. h, like any constant of nature, is just a conversion factor for 
the values of one and the same physical quantity in different units. Before 
this constant of nature is discovered, the values of this same quantity in 
different units (e.g., the momentum p and the wave vector k) are thought 
to be different physical quantities. After the constant has been discovered 
and the different physical quantities have been related by a new theory, one 
can measure these quantities in the same units and eliminate in this way 
one of the previously independent units. 


201eV = 1.6 x 10°! erg = 1.6 x 107 '* dyn cm; 1 dyn = ig cm/sec. 
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Problems 


1. Show that the operators a and a‘, defined by the equations (3.1), are the adjoints 
of each other. 

2. Prove the following statements: 

(a) All eigenvalues of a Hermitian operator are real. 

(b) Let f be an eigenvector of A corresponding to the eigenvalue A. Show that for 
each nonzero «€ C the vector af is also an eigenvector of A corresponding to the 
eigenvalue af. 

(c) Let f, and f, be eigenvectors of A, with respective eigenvalues 2, and A,. Show 
that if A, # A,, then f, and f, are orthogonal. 

3. Show that the collection of all linear operators on a (complex) linear space R 
constitutes an associative algebra with unit element under the operations of scalar 
multiplication of an operator, addition of operators, and multiplication (composition) 
of operators. 

4. Show that the Cauchy-Schwarz Bunyakovski inequality [Equation (I.2.7)] 
follows from the definition (1.2.5) of a positive Hermitian form. 

5. For the quantum-mechanical harmonic oscillator, calculate the diagonal 
matrix elements of the energy operator H between the states 


bn = en, 
where @,, is the eigenvector belonging to the eigenvalue n of the operator N = ata, 
according to Equation (3.20). 
6. Let @, be the vectors of Equation (3.19). Define for each complex number z 
the coherent state vector 


Ze 


Oo 
fon fey 
Bc aes dn. 
n=0 


n!} 
(a) Show that the coherent state |z) is an eigenstate of the operator a with eigenvalue z. 
(b) Show that the scalar product of ¢, with |z> has the property 


iD e oe 


nl 
(c} Show that the scalar product <=|2’> of two coherent states |=> and |z’> is given by 
<z|z’> = exp(—$|z|? + Zz’ — 41z'/”). 


(d) Calculate the expectation value of the operators H, Q, and P in the coherent state 
|z>, ie., calculate (z|H|z>, <z|Q|z>, and <z|P|z). 

7. Figure PS.la shows a schematic diagram of a slightly different experimental 
setup than the one of Figure 4.2a. Results of an energy-loss experiment performed on 
N, with this apparatus are shown in Figure PS.1b. The bump at v = 0 in Figure PS.Ib 
is approximately 30 times larger than the bumps at r = 1, 2,3....; this ts in contrast 
to the situation depicted in Figure 4.2b, where the vr = 0 bump is approximately 3 times 
larger than the succeeding bumps. Explain the difference. Determine from Figure 
PS. 1b the statistical operator W that describes the state of the ensemble of N, molecules 
of the experiment of Figure PS. 1a. 


8. Suppose we have a quantum-mechanical oscillator. 
(a) Let the state of the system be given by (4.17). Show that (4.21) follows from (4.20). 
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Figure PS.1 (a) Schematic diagram of the double electrostatic 
analyzer. Electrons are emitted by the filament, deflected by the 
cylindrical grids (4A and 4B) with radii of 1.0 and 1.5 cm, respectively, 
injected into the collision chamber 5, and analyzed by sweeping the 
voltage between electrodes 6 and 3. S,, S,, 83,54, S, are shields to 
collect unwanted electrons; 4C and 7C are top and bottom grids. 
Typical operating voltages between the electrodes indicated: 

(4A-4B) = 1.2 V; (7A-B) = 1.2 V; (Fil-3) = 1.4 V; (F-S,) = 20 V; 
(F-S,) = (F-S3) = 20 V. The electron collector is grounded. All slits 
are 0.5 x 4mm. (b) Energy spectrum of forward-scattered electrons at 
an incident electron energy of 2.6 eV. [From Schulz, G. J: Physical 
Review /25, 229 (1962), with permission. ] 
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(b) Show that W = W/(Tr W) is the normalized statistical operator, where W is 

given by (4.22). 

(c) Let A bea projection operator. Show that Tr A = dim(A#). 

9. Calculate the expectation values of the operators Q, Q?, P, and P? when the 
quantum-mechanical oscillator is in the state W = A,, where A, is the projection 
operator onto the energy eigenspace corresponding to the eigenvalue E, = hw(n + 4). 

10. Let A be the projector onto the one-dimensional subspace spanned by 
W (\lW|| = 1). Show that the dispersion disp,,, H of H when the system is in the state 
W = A satisfies 


disp,,) H = 0. 


11. Calculate dispyy,H for the state of the CO molecules in the collision chamber 
of the energy-loss experiment described in Section II.4. Show that dispyy, H > 0. 

12. Let A be the projection operator onto an n-dimensional eigenspace of the 
operator A. Show that disp,,, A = 0. 

13. Let W be given by (4.31). Calculate <A> and show that this is the result we 
would expect if <A) is to represent the average value of A. Is W’ as given by (5.1) normal- 
ized, i.e., does Tr W’ = 1? 

14. Let H and B be two observables with spectra E,, and b;, respectively. Let A, 
and A, be the projection operators on the eigenspaces of H with eigenvalue E, and of B 
with eigenvalue b;, respectively. Let the quantum-mechanical system be in a state in 
which an energy measurement has always resulted in the value E,,. 

(a) What is the statistical operator W for this state? 

(b) Inthis state W a measurement of the observable B is performed. What is the statis- 
tical operator W’ after the B measurement, without selection of subsystems? 

(c) What is the probability of finding the value b; in a measurement of B on the 
physical system in the state W? in the state W'? Compare these two probabilities. 

(d) The observable H is now measured again. What is the probability of obtaining E,, 
in this measurement of H? Compare this probability with the probability of 

obtaining E,, in the original state W. 

(e) Let A bea third observable with eigenvalues a; and projectors A,, onto the eigen- 
spaces. Calculate the probabilities of obtaining the value a, in a measurement of 

A when the system is in the state W and when the system is in the state W’. When 

are these probabilities equal? 

15. Calculate the matrix elements (n|P|m> and <n|Q|m) of the operators P 
and Q between the energy eigenvectors $, = |n> of the harmonic oscillator. 


16. The Hermite polynomials are defined by 
4 d" —y2 
H,(y) = (- 1" — (e-”). 
dy 
(a) Show that the H,(y) fulfill the recurrence relations 
H,(y) = Pas (ie i(y) _ 2(n oe Dis a(y), 


dH,(y) 
dy 


= Fel) 


(b) Show that H,(y) satisfies the differential equation 
Hi(y) — 2yH,(y) + 2nH,(y) = 9. 
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(c) Give explicit expressions for the first six Hermite polynomials 
Hily) (n= 01, 2.33455): 


(d) Show that the Hermite polynomials fulfill the following orthogonality relation: 


+00 


dy eH, y)H,(y) = 2'n! / 15 an: 


17. In the energy eigenstate W = A, corresponding to the energy eigenvalue 
E, = thw, the probability w,(x) Ax of measuring the position to be within an interval 


(x — Ax/2, x + Ax/2) about the value x is nonzero for x > a = ./2E,/(uw") (cf. 
Figure 9.2). From this it follows that the potential energy U may be larger than the total 
energy E, because 


U = 4yw?x? > 4ym?a? = E,. 


Point out what is wrong with this argument, and give a correct interpretation of the 
situation. 


18. Show that the operators P'S) = (h/i) d/dx and Q“ = x, considered as operators 
on the wave function, satisfy the Heisenberg commutation relation. 


19. It was derived in Section II.8 that, on the wave function, the momentum 
operator P and the position operator Q are represented by the operators P™! = 
(h/i) d/dx and Q®) = x. Which assumption besides the Heisenberg commutation rela- 
tion has been used in this derivation? 


20. Let a quantum-mechanical particle with mass m be confined by impenetrable 
walls to the region —ua < x < +a (a one-dimensional rectangular well with infinitely 
high walls). This means that the expectation value of the potential-energy operator }’ 
between the generalized position eigenstates |x) is 

WV), = x|VIx> 
eS 
where ¢ is a constant; and that the probability that a measurement of the position Q 
will give the value x is nonzero only for ~a < x < +a. Denote by |) the eigenvectors 
of the energy operator 


ys 


H=—+1 
2m 
and let A,, denote the projector onto the space spanned by |). Calculate the eigen- 
values of H, and compute the expectation values of Q, Q*, P, and P? when the system is 
in an energy eigenstate A,,. 


21. Show that for any linear operators A, B and for Ae C, the trace as defined 
by Equation (4.3) has the following properties: 
(a) It is independent of the choice of basis. 
(b) Tr(AA) = A Tr(A). 
(c) Tr(A + B) = Tr(A) + Tr(B). 
(d) Tr(AB) = Tr(BA). 

22. Let W(p) be an almost momentum eigenstate described by the statistical 
operator 


w= | dp | dp’ Ko — Dst0" - DIp>rer"l 
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with 


a 
IAP 2 aa pe 


(Lorentzian momentum distribution). Calculate the probability density of the position 
operator and the probability of finding the position in the interval x» — Ax/2 < x < 
X9 + Ax/2. Discuss the result. 


23. Davisson and Germer scattered low-energy electrons from metal targets. For 
45-eV electrons incident normally on a crystal face, compute the angle between the 
incident beam and the scattering maximum if the metal is assumed to be of simple 
cubic structure with a lattice constant of 3.52 A. 


24. Has it been derived that electrons are waves with wavelength A = h/p? If so, 
then explain how this has been done. If not, explain what has been derived. 


CHAPTER III 


Energy Spectra of Some 
Molecules 


Section III.1 discusses how the energy levels of a quantum-mechanical 
system emitting dipole radiation are observed. A derivation of the transition 
probability is not given. In Section III.2 the defining relations of angular 
momentum are established. Section IJI.3 derives the representations of the 
algebra of angular momentum. In Section III.4 the energy spectrum of a 
rotator is derived and compared with the experimental spectrum of diatomic 
molecules. Section III.5 contains the basic assumption about the physical 
combination of two honidentical quantum-mechanical systems and the 
application of this assumption to the description of vibrating and rotating 
diatomic molecules. 


III.1 Transitions between Energy Levels of Vibrating 
Molecules—The Limitations of the Oscillator Model 


A quantum-mechanical system in a certain stationary state [e.g., the energy 
eigenstate A, of the diatomic molecule (oscillator)] will remain in that state 
so long as it is not acted upon by outside forces. In practice, any quantum- 
mechanical system is acted upon by weak external forces, such as external 
electromagnetic fields or internal electromagnetic fields that arise from the 
motion of charges within the system. Under the influence of such forces, the 
state is liable to change. If the system has a discrete set of states (e.g., the 
energy eigenstates of the oscillator), then a weak external disturbance does 
84 
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not change these states (or, more precisely, it changes the energy levels by a 
negligibly small amount), but the system may jump from one state to another. 

The theory of such transitions, which can be developed as a consequence of 
the basic assumptions of quantum mechanics, will be presented later in 
Chapters XIV and XXI. For the moment we shall just give some semi- 
classical arguments and state the result, which we shall use to obtain transi- 
tion frequencies and selection rules. 

We accept here as an empirical fact that under the influence of ever- 
present external disturbances, the quantum system may perform transitions 
from one energy eigenstate with energy E, to another with energy E,,, and 
emit or absorb the energy difference 


En — Ep 


as electromagnetic radiation in the form of a light quantum or photon of 
frequency 


Vem = A = (1.1) 


If the electromagnetic field has the frequency v,,, then the quantum- 
mechanical system can absorb a photon of this frequency and jump from a 
state of energy E,, to a state of higher energy E,,. On the other hand, if a 
quantum system is in an excited state E, (a state of higher energy than the 
ground state), it can emit a photon of frequency Vj, and drop to a state of 
lower energy E,,. 

Transitions between two states cannot occur under the influence of electro- 
magnetic radiation if the matrix element of the total electric displacement 
operator D of the system vanishes between these two states. Also, the proba- 
bility for such a transition, and thus the intensity of the emitted (or absorbed) 
electromagnetic radiation, is proportional to the square of the modulus of 
this matrix element. 

To illustrate, let us return to our classical picture of the diatomic molecule. 
If the molecule consists of unlike atoms (e.g., CO) then it has an electric dipole 
moment, since the centers of the positive and negative charges do not coincide. 
The dipole moment is the vector directed from the center of negative charges 
to the center of positive charges and is given by 


D = qd, 


where q is the charge and d is the distance between the centers of the charges. 
The permanent dipole moment D, of the molecule lies along the internuclear 
axis. 

Ifthe interatomic (or internuclear) distance changes, the dipole moment will 
change, and to a good approximation it may be assumed that the dipole 
moment is a linear function of the deviation from the equilibrium position of 
the interatomic distance: 


D = Dy + 4x. (1.2) 
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Therefore the dipole moment changes with the frequency of the mechanical 
vibration. Oscillating charges radiate an electromagnetic field, and on the 
basis of classical electrodynamics the emitted light should have a frequency 
equal to the frequency of the oscillator, i.e., 


a) 

ae (13) 
where w = ./k/m is the angular frequency of the classical oscillator [see 
(11.2.8) ]. 

If the molecule consists of two like atoms (e.g., O,, N>), then the dipole 
moment is zero, because the centers of positive and negative charge coincide 
and oscillations of the molecule about its equilibrium position do not lead to 
emission or absorption of electromagnetic radiation. 


Note: The above considerations apply only to dipole radiation; 
there may be quadrupole or higher multipole radiation even for 
molecules for which the dipole moment is zero. However, the 
magnitude of the higher multipole radiation is exceedingly small. 


Let us now turn to the quantum-mechanical molecule. Quantum- 
theoretically, the emission of radiation takes place as a result of a transition 
of the oscillator from a higher to a lower state, and absorption takes place by 
the reverse process. The frequency of the emitted light is given by 


y = bn = Em 


The intensity of the emission, classically proportional to the time-averaged 
value (over one period) of the square of the dipole moment D, is in quantum 
theory proportional to the absolute value squared of the transition matrix 
elements 


(1.1) 


<m|Din> = Dn; (1.4) 
where D is the dipole operator, obtained from (1.2) by the usual procedure of 
replacing the number x with the operator Q: 

D= Dy, + qQ (1.5) 


The transition probability (intensity) A,,,, for spontaneous dipole emission! 
in the transition from an energy state with energy value E, to a state with 
energy E,, is given by 


— Z 3 y) 
Aim a 3hc3 Orn ; (1.6) 
where @,,, = (E, — E,,)/h, c is the velocity of light, and 
1 j 3 
Del = 5. sree ALD? SD) ee d An, v>|?. 
Dim = Gama.) 1 = rato) 2, 2, MIP 


(1.7) 


' The probabilities for induced emission and absorption are also proportional to |D,,,|7 and 
to the intensity of the incident radiation. 


Transitions between Energy Levels of Vibrating Molecules 87 


Dis given by (1.5), A,, is the projection operator on the energy eigenspace with 
eigenvalue E,,, and dim(A,, #) is the dimension of this energy eigenspace. 

Since we are now not interested so much in knowing the intensity as we are 
in knowing when this intensity is zero, the precise form of the transition 
probability is not of primary interest to us at the moment. Equation (1.6) can 
be derived using the general formalism developed in Chapter XXI. For the 
special case of the one-dimensional oscillator we replace the dipole and 
transition vectors by the one-dimensional quantities D and Q and (1.7) goes 
over into |<m|D|n)|?. 

For many quantum-mechanical systems, a great majority of the matrix 
elements of D vanishes, so there is a severe limitation on the possibilities for 
transitions. The rules that express this limitation are called selection rules. 
In order to determine which particular transitions can actually occur for 
the harmonic oscillator, we have to calculate the matrix elements 


<m|D\|n> = q<m| Qn). (1.8) 


The matrix element of the position operator in the energy eigenstates has 
already been calculated [see Equation (I1.8.3)] and is given by 


h 
<m|Q|n> = ie (/nim|n — 1>+./n+ 1<m|n+1). (19) 


Thus we see that the transition probability and hence the emission and 
absorption intensity of light are zero except when the quantum numbers n 
and m differ by unity. Thus the selection rule for the harmonic oscillator is 


p= ae (1.10) 


Transitions in the harmonic oscillator are possible only between neighboring 
energy levels. The frequency of light that is emitted (for E,, > E,,) or absorbed 
(for E,, < E,) is given according to (1.1) and (1.10) by 

ton = Fn OT m+ MS. tN) 
Thus quantum-theoretically the frequency of the radiated light is equal to the 
frequency «/2z of the oscillator, and is independent of the energy level n. 
Similar arguments apply for absorption. Thus we have seen that for the 
particular case of the quantum-mechanical harmonic oscillator, the frequency 
of emitted and absorbed light is the same as it would be for the classical 
oscillator. 

If we recall the energy-level diagram for the harmonic oscillator (Figure 
1I.4.3) we can indicate the allowed transitions by vertical lines (see Figure 1.1). 
All these transitions give rise to the same frequency. This is a consequence of 
the equal spacing between energy levels. 

For a diatomic molecule consisting of two like atoms (e.g., O2), the dipole 
moment operator (1.5) is the zero operator, and therefore no transitions 
between different energy levels occur. 

Let us now turn to the comparison of our theoretical results with the 


experimental situation. 
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Figure 1.1 Dipole transitions between the energy levels of the harmonic 
oscillator. 


In order to find out what frequency we should expect, we turn first to the 
energy-loss spectrum of the CO molecules (Figure I1.4.2). From the distance 
between the bumps in the energy-loss spectrum we find that the difference 
between the various energy levels of the vibrating CO molecule is 


AE 0205 eV. (1.12a) 
If we calculate the frequency from this according to (1.11) we find 


_ AE 0.265 eV 


SSS SS ee 13 -1 
~ Anh 2n x 6.58 x 107 !® eVsec Ot ae (1.12b) 


and 


re - = 0.466 x 10-3 cm = 4.66 1 (1.12c) 


(esi0- 4 cn eo nen 


In spectroscopy it is customary to give the frequency not in sec” ' but in 
cm’, i.e., give instead of the frequency v the wave number v/c = 1/A, which 
indicates the number of waves per cm. We shall not introduce a new symbol 
for it but also call it v, and the unit next to the number will then tell us what 
is meant. The frequency in cm‘, or wave number, of the radiation emitted 
by the transition between the vibrational levels of CO is then 


v= 2140 cm™?. (1.12d) 


Thus we expect, from the energy-loss spectrum of CO, that the vibrating CO 
molecules emit or absorb electromagnetic radiation only with the frequency 
given by (1.12), 1.e., we expect one spectral line in the near infrared region.’ 
If we compare this with the absorption or emission spectrum, we find that 


“In order to give a feeling for the orders of magnitude of various quantities involved in 
molecular spectroscopy, we show a table of the regions of the electromagnetic spectrum in 
Figure 1.2. 


Transitions between Energy Levels of Vibrating Molecules 89 


THE ELECTROMAGNETIC SPECTRUM 


Energy Frequency ‘Transitions Radiation Wave Number Wave Length 
-1 - 
E (ev) v(sec™’) Nuclear Magnetic ; v(cm™’) A (cm) 
Resonance Radio 
7 Spin Orientations Waves 
5 x 10-° 12°10? in Magnetic Field 4x 10-7 25 
Electron Spin Micro- 
Resonance waves 
31 x 10-3 15 11.) Molecular (sader) 
alex 10" Sx 10 ie 
Rotations as ae 
5 <0 12 se SOUS 400 DiSexelOm 
Molecular Infrared 
Vibrations Region 
0.5 12 1024 4000 DSc 105: 
1.55 31890 101 12.5 x 10? 8 x 10°5 
Visible 
3.1 Tis) xe 25 x 10° 4x 107° 
Valence Electronic 
Transitions 
6.2 1.5 x 105 Ultra- + 50 x 10° 2x 10-5 
violet 
1240 3 x 10!” 10’ 10m 
Inner Shell 
Electronic 
Transitions 
1.24x 10 3x 10!8 Xerays L s0* 10-° 
Nuclear Gamma 
Transitions Rays 


Figure 1.2 Schematic diagram of the electromagnetic spectrum. Note 
that the scale is nonlinear. Boundaries between regions are generally 
quite arbitrary. 


it is indeed correct. If the absorption spectrum is obtained with a thin layer 
of absorbing gas, one finds only a single, broad, intense absorption line (or 
band) in the near infrared region, with a wavelength around A = 4.66 yp. 
For other diatomic molecules consisting of unlike atoms, one finds the same 
situation; e.g., for HCI this band lies at A = 2.46 py. One also finds that such 
bands do not appear for molecules consisting of like atoms, such as Op, 
N,, H2. 

If the absorption is observed with thicker layers of gas, the intensity of 
absorption of the fundamental band naturally increases, and in addition a 
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Figure 1.3 Coarse structure of the infrared spectrum of HC] (schematic). 
The intensity actually falls off five times faster than indicated by the 
height of the vertical lines. Herzberg [1966] vol. 1. 


second band of similar form appears quite weakly, at approximately half the 
wavelength or double the frequency (wave number). If the thickness of the 
layer is still further increased (up to several meters at atmospheric pressure), 
a third and possibly even a fourth and a fifth band appear whose wavelengths 
are approximately a third, a fourth, and a fifth, respectively, of that of the 
first band; that is to say, their frequencies are three, four, and five times as 
great. Figure 1.3 gives schematically the complete infrared spectrum of HCl. 
In this figure the lengths of the vertical lines that represent the bands give an 
indication of their intensity. However, the actual decrease in intensity is five 
times as fast as is indicated in the drawing. 

The explanation of these additional bands with lower intensity is that the 
diatomic molecule is not quite a harmonic oscillator. In a harmonic oscillator 
the restoring force increases indefinitely with increasing distance from the 
equilibrium point. However, it is clear that in an actual molecule, when the 
atoms are at a great distance from one another, the attractive force is zero. 
Thus the quantum-mechanical harmonic oscillator is only a simplified model 
of the vibrating molecule, and if one wants to describe the finer details of 


[5, 


> wnas 


Figure 1.4 Energy levels and infrared transitions of the anharmonic 
oscillator. The absorption spectrum is given schematically beneath. 
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vibrating molecules, then the anharmonic forces also have to be taken into 
account. The energy levels of the anharmonic oscillator are not equidistant 
like those of the harmonic oscillator, but rather their separation decreases 
slowly with increasing n. 

The energy levels and absorption spectrum for an anharmonic but almost 
harmonic oscillator are shown in Figure 1.4. (For the sake of clarity a faster 
decrease of AE is drawn than is actually found in most observed cases.) The 
selection rule (1.10), 


fife ae |e 


holds only approximately for the anharmonic oscillator and applies only to 
the most intense transitions. But now transitions with n — m = +2, +3,... 
can also appear—though with rapidly decreasing intensity. All these results 
can be calculated using perturbation theory, which we shall introduce in 
Chapter VIII. We describe these facts here to demonstrate that the simple 
soluble quantum-mechanical models like the harmonic oscillator describe 
only the principal structures of a microphysical system in nature, and cannot 
be expected to describe all the details. This is not a deficiency of the harmonic 
oscillator model but a general property of physical theories. Models are only 
idealizations and cannot be expected to reproduce the experimental results up 
to the last digit. The explanation of a new decimal place in an experimental 
number often requires a new model, and sometimes a completely new theory. 

We shall see this presently when we consider the transition frequencies in 
the near infrared region in more detail, as obtained with a spectrometer of 
sufficiently high resolution. The broad spectral lines for the CO molecule 
around v = 2140 cm™! is then resolved into a number of individual narrow 
lines, as shown in Figure 1.5. That is, around v = 2140 cm! one does not 
have a single line but a band, called the vibration-rotation band. As one 
sees from this figure, this band consists of a series of almost equidistant 
lines, with one line missing in the center of the band. Going out from the 
gap, there are two branches, which are called the P branch (towards longer 
wavelengths) and the R branch (towards smaller wavelengths). Figure 
1.6 shows the same effect for the n = 1 line of Figure 1.3 for HCI. 

One would expect such fine structure in the absorption or emission 
spectrum of electromagnetic radiation for the CO molecule if the energy 
levels of the vibrating molecule of Figure 1.1 were split into a series of sub- 
levels as shown in Figure 1.7, which shows only any two neighboring energy 
levels of the energy spectrum of the vibrating molecule as given in Figure 1.1. 

The description of such a splitting lies outside the capability of an oscillator 
model. It can only mean that a state characterized by the quantum number n 
is not a pure state but is in fact a mixture of states with different energies. In 
the oscillator, however, the state characterized by n was a pure state described 
by a projection operator A, on a one-dimensional subspace spanned by 
¢,, namely A, #. The state of the diatomic molecule characterized by the 
quantum number n must have at least as many dimensions as energy levels 
(when number of energy levels equals the dimension then to any energy value 
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Figure 1.6 The fundamental absorption band for HCl under high resolution. (The lines are doubled due 


and C]9’ in the ratio 3:1; 


c 


we do not discuss this effect here.) [From Alpert Keiser and Szymanski (1970), 
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there belongs a one-dimensional subspace or a projector on a one-dimen- 
sional subspace). Consequently the oscillator model alone describes only 
part of the properties of a diatomic molecule, and in order to describe the 
finer details of the spectrum we have to combine the oscillator model with 
another model, which describes these finer details and which reflects ad- 
ditional features of the diatomic molecule that we have so far not taken into 
account. This new model is the rotator model. 

If we return to our classical picture of the CO molecule, considering it as 
two atoms of mass m, and m, at a distance x, then we see that this classical 
object not only can perform vibrations along the x-axis, but also can 
perform rotations in three-dimensional space around its center of mass. As 
long as it is in the vibrational ground state, that is, as long as the energy 
involved is less than 0.26 eV for CO, it is a rigid rotator, 1.e., it can be con- 
sidered as two pointlike masses m,, m2 fastened to the ends of a weightless 
rigid rod of length x. We shall therefore first study the rigid-rotator model by 
itself. This will provide us with a description of the CO states that are charac- 
terized by the quantum number n = 0, and will also approximately describe 
each set of states with a given vibrational quantum number n. Then we shall 
see how these two models are combined to form the vibrating rotator or the 
rotating vibrator. 


IlI.2 The Rigid Rotator 


To conjecture the mathematical image (the algebra of operators) for the 
rotator, we proceed in the same way as for the oscillator: We consider the 
classical rotator, and we replace the three coordinates of momentum p; 
and the three coordinates of position x; in all expressions for the observables 
with operators P; and Q; that fulfill the canonical commutation relations 


(P.O)=2dyf (0.0)=0 [P.PI=%  @D 


—the obvious generalization of (I].2.1a) to the three-dimensiona! case. 
In classical mechanics the energy E of rotation of a rigid body is given by 


= ho’. (2.2) 


Here ~ is the angular velocity of the rotation,* and J is the moment of inertia 
of the system about the axis of rotation. The angular velocity is related to the 
number of rotations per second, v,,, (the rotational frequency) by 


Git 20. (2:3) 


The angular momentum of the system is given by I = I. Introducing this 
into (2.2), the energy may also be expressed by 
i? 
= _. 2.4 
E=57 (2.4) 


3 We use the same symbol «» for the rotational angular velocity that we used for the vibrational 
angular velocity in Chapter II. 
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Figure 2.1 Dumbbell model of a diatomic molecule. 


The moment of inertia about the axis of rotation for the rotator model is 
given by 


2 2 
I= myrj{ + m2r3, 
where 


mM, my 
iy = 3,5 and LPs) == 


= —___—__ —— — (2.5) 
m, +m, m, +m, 


are the distances of the masses m, and m, from the center of mass C, and x 
is the distance between the two mass points (see Figure 2.1). Substitution 
gives 
mym 
j erent hae (2.6) 
my, + mM 
that is, the moment of inertia about the axis of rotation is the same as that of 
a mass point of mass 
mym 
~— (29) 
m, + m, 
at a distance x from the axis; wis called the reduced mass of the molecule. 
Thus, instead of considering the rotation of the rigid rotator, we can 
equally well consider the rotation of a single mass point of mass p with 
coordinates x;, where the vector x = (x,, x2, X3) is the position vector for the 
mass point yw with the center of mass C as the origin. If we denote the 
momentum of the mass point y in this coordinate system by p = (p;, p>. p3)s 
then the angular momentum about the point C is given by 


ee ea (2.8) 


so its components are given by 
le =D. Gin XjPu = CijeXjPe- (2.9) 
Tsk 


In this equation ¢;,,= +1 for (ijk) = (123) and every even permutation 
thereof, ¢€;;, = —1 for (ijk) an odd permutation of (123), and €; jx, = 0 other- 
wise. It is understood from now on that two identical indices in a term indicate 
a summation from | to 3 on those indices unless stated otherwise. 

The operators that represent the corresponding quantum-mechanical 
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observables are then obtained by our general principle of replacing x;, p; 
by Q;, P; fulfilling (2.1). Thus the angular-momentum operator L is given by 


i= xP. 
or 
L; = €ij, Q;Px: (2.10) 
and the energy operator is given in correspondence to (2.4) by 
a ee Se eet (2.11) 
21 PN Cece Py | 


As Q,and P, are Hermitian operators, it follows that L; and H are Hermitian: 
L} = PLOY = ijn PrQj = €ijxQj Px = Li- (2.11a) 


From the Heisenberg commutation relations (2.1) one obtains the com- 
mutation relations of the operators L; that represent the components of 
angular momentum. The calculation goes as follows: 


[L;, Li] = €ijn€tmnlQj Pr, Om Prd 
= €:j¢€tmn(QjLPx, QmPad + (Qj, Qn Prd Pr) 
= Ej Em QALPr, Onl Pn + QnlPrs Pad} 
+O Ola a Onl; P,J}P,) 


h h 
= nti OP, 7 Orne Onlin 7 3, 


h 
mer (€:je€ten Qj Pn — €;:jx€1mj QmPe)- 
By changing the summation indices we can write this as 
h h 
7 (Geer On Pn — GiiSine Omi) = 7 (imi Ext Tem euiOmin: (2-12) 


We now make use of the following property of the €;,,,, which is easily verified : 
Gree) = Onn = Onno (213) 
Writing the second term in (2.12) in the form (2.13): 
Cink Exim = Oi: Onm — Oim Ont 
and adding it to (2.13) gives 
Cimk nt + Eink Exim = Oin Omt — Sim Ont = — ith bmn: 


The last equality again follows from (2.13). Substitution of this expression 
into (2.12) gives 


[L;, Li] = IE it, Ekmn OL: 
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If we insert the definition (2.10) of L; on the right-hand side, we obtain the 
commutation relation of the angular-momentum operators L;: 


(lees L,] = ihe, Ly. (2.14) 


We have obtained the commutation relations (2.14) of the angular- 
momentum operators (2.10) from the Heisenberg commutation relations 
(2.1). The expression of the energy operator (2.11) does not contain the P; 
and Q explicitly; this is true for all physical observables of the rotator. In 
fact the operators P; and Q; are unphysical observables for the quantum- 
mechanical rotator system in the following sense: It is not possible to prepare 
the quantum-mechanical rotator so that it is in a generalized eigenstate of 
momentum or position. Therefore, for the rotator, the L; that obey the 
commutation relations (2.14) are the fundamental physical observables, and 
the fact that they may be obtained from the momentum and position opera- 
tors can be ignored. In fact, the operators L; defined by (2.10) or the classical 
quantities |; defined by (2.9) are a particular case of an observable that is 
connected with new degrees of freedom of physical systems in the three- 
dimensional space. 

A physical object in the three-dimensional physical space has six degrees 
of freedom: three translational degrees of freedom described by the three 
coordinates x;, and three rotational degrees of freedom, described by a 
rotation R(a, B, y) that depends upon three angles «, f. + (e.g., the three 
Euler angles, or the three angles of rotation around three fixed coordinate 
axes). The momentum p, is the canonical variable conjugate to the coordinate 
x;, and the canonical variable conjugate to the angular coordinates ~; is the 
angular momentum f;. 

If we forget about the constituents of the diatomic molecule and consider 
the dumbbell as an entire physical system whose center of mass is fixed at a 
particular point in space, then the remaining degrees of freedom are the 
rotations by the three angles, and the dynamical variables are these angles 
and their conjugates, the angular momenta /;. In our particular case above 
the |; are obtained from the momenta and coordinates of the constituents, 
which are thought of as mass points, i.e., not possessing rotational degrees of 
freedom. 

In general, a classical extended particle, in the three-dimensional physical 
space, has as variable the momenta p; and the angular momenta s; (spin) 
associated with it, which are the conjugates of the linear coordinates x; 
and the angular coordinates a; respectively. For the case that the extended 
physical object is a dumbbell the spin s; is the orbital angular momentum 
I; = €;j,X;P, Of its two constituents. 

For the quantum physical extended particle, the momentum is represented 
by the operator P;, and the spin by the operator S;. It is now easy to con- 
jecture that the defining commutation relations of the spin operators S; are 


[S;, Sj] — ihe; Sy: (2.15) 
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For the special case of the spin of the dumbbell, these relations were derived 
in (2.14) from the relations (2.1), which had already proven to be successful. 
Equation (2.15) can also be derived from the properties of the group of 
rotations, if one assumes that the rotation R(a, B, y) of the physical object is 
represented (continuously) by a (unitary) operator U(q, f, y) in the space 
of physical states of this quantum-physical object. This latter assumption 
can be derived from the fact that the rotations are symmetry transformations 
and that the rotation group is a symmetry group of the physical system 
(Wigner Theorem). We will not discuss this connection here, but we take 
the algebra of observables defined by (2.15) as the starting point. 

We will now investigate the properties of the algebra of operators generated 
by the J;, which fulfill the commutation relations 


LJ: Ji] ae ihe J) (i, k= iis Zs 5); (2.16) 


where J, may be either an L; as given by (2.10) or an S;. In particular we shall 
find the properties of all the J;, which are linear Hermitian operators in a 
linear space. It will turn out that the set of all J; is richer than the set of the L; 
given by (2.10). The algebra generated by the J; is called [because of its 
connection to the rotation group denoted SO(3)] &(SO(3)) or &(SU(2)). 


I1I.3 The Algebra of Angular Momentum 


We shall now find all possible solutions of the commutation relation (2.16) 
that fulfill yt = J;. (We remark that to be mathematically rigorous, the 
condition J? = J; would have to be given a mathematically precise formula- 
tion, which we replace with the additional assumption that there exists at 
least one eigenvector of J, in the Hilbert space.) 

Instead of working with the J; (i = 1, 2, 3), we introduce the following 
linear combinations: 


Hee J, He he (Fy + iJ) H_=h'(V, —iJ2). GB.) 
The Hermiticity condition J! = J; is expressed by 
i she At, ee JEP eae (3.2) 
From (2.16) it then follows that 
[H3,H.]= +H:, 
[Hen 2H.. 


G3) 


The operator J? can be written 
J? = WH? (3.4) 
with 
H? = H,H_ + H2—H,=H_H, + H3 + Hs. (3.5) 
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H? and therefore J* have the property that 
it) — He = 0: (3.6a) 
or in general 
prea = 0) (3.6b) 


where 4 is any element of the algebra of operators 6(SU(2))* generated by 
J; Gi = 1, 2, 3), 1.e., A is any element of the form 


A=ata'), +'a4J,J,+ @*S,JjJ,+---, (3.7) 
where a, a’, a'/, a*,..., are complex numbers. Because of the property (3.6), 
H? and J? are called invariant operators (Casimir operators) of the algebra 


&(SU(2)). 

We now find all ladder representations of &(SU(2)), i.e., all solutions of the 
commutation relations (2.16) by linear operators in linear spaces that are 
obtained by applying to one eigenvector of H, the whole algebra &(SU(2)). 
(This one eigenvector is assumed to exist.) 

Let f,, be normalized and be an eigenvector of H; with eigenvalue m, i.e., 


(Gen Sn) =a IP Hy fin = Min (3.8’) 
(f,, are called weight vectors; m is called a weight.) Then we define 
IBS a is i 


and calculate using (3.3) 


Ay fms = 3H: fn = (H,H; ae Hae) hin 
(=(i,mt Ha)f, = (mt DNs Sn (3.8) 
me (m aE Dacre 


Thus f,,4 is, if it is different from 0, an eigenvector of H, with eigenvalue 
mE 

tm can be chosen so that it is not only an eigenvector of H, but also an 
eigenvector of H?, because H; and H? commute. 


PROOF. 
HH, = WA; 7, — ete 


Let us denote the eigenvalue of H? belonging to f,, by c: 
H fn = Sm: (3.9) 


Then the above relation is an identity. Note that if two operators do not commute then 
in general they have no common eigenvectors. a 


We note some properties of (3.9): 


1. The number c is nonnegative. 


* The symbol & stands for enveloping algebra. 
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PROOF. 


C= (fms Af) =O  fins IT + JE + I3) Sn) 
=h-*(Fy fall? + VJ2 full? + ls fall?) = 0. O 


2. Any vector f obtained from f,, by applying any element A of the form 
(3.7) on it: f = Af is again an eigenvector of H? and has the same 
eigenvalue c. 


PROOF. 
H?f = H’Af,, = AH7f,, = cAf,, = cf, where we have used (3.6). O 
3. Any eigenvalue m of H, fulfills 
ne = (3.10) 
PROOF. 


0 Ss HEAD i Sons J fea) tr (J2 te J, 1; 
= hfs I7fm) ~ Sms 13 Smad} = 7H 7e — m”)|| fiall’, 


from which follows (c — m*) > 0. 


Note that in the proofs the Hermiticity property of J; has been used. 

If we start with an arbitrary eigenvector f<¢,, of H, and H? and apply 
H., successively, then we obtain according to (3.8) new eigenvectors f;, of 
H, with ever-increasing eigenvalues. Because of (3.10), after a finite number 
of steps we must reach the eigenvector f{° with the largest eigenvalue | of H, 
i.e., with 


i+ = Hy, fi = 9. (3.11) 
From (3.5) it then follows that 
H? fi = (83 + As) fi = 1 + Ui. (3.12) 


Consequently the eigenvalue c of H? and the largest eigenvalue / of H3 
(highest weight) are connected by 


c = II + 1). 


Instead of characterizing the eigenvectors of H” and H, by c and m, we can 
characterize them by / and m. 

If we apply H_ successively to f;,, then after a finite number of steps, 
because of (3.10), we must reach the vector f/, with the lowest eigenvalue pu 
of Aa, tes with 


fi- = H_f, = 0. (3.13) 
From (3.5) it then follows 
H? fi, = (H3 — Hs) f, = wu — DSi (3.14) 
Comparing (3.14) with (3.12) we find 
Ki + 1) = pl — 0), 
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and the only solution of this equation for y that fulfills (3.10) is 
p= —L (3.15) 


Thus if we start with the vector f!, apply H_ successively to it and normal- 
ize, we obtain the sequence of vectors 


ties = (Ce Het, 
fina = (j—1)" "H- fila, (3.16) 


ai = (9 to. 


where the @,, are 


ty = (A fins A fn) 
We shall finally reach the vector with the smallest weight y: 
fi = Gus) Heft 
As p = —I, we have 2] + 1 vectors in the sequence (3.16). 
fi, withm=11-—1,1-—2,...,-—14+1, -l, (3.17) 
which fulfill 
CAG See (3.18) 


As 2] + 1 isa number of vectors, it must be an integer; consequently / can be 
only one of the following numbers: 


i= 0,4, 1,3,... (3.19) 


Thus for a given number /, which can be one of the numbers in (3.19), we 
have 2/ + 1 vectors f}, that are orthogonal to each other and span a space, 
which we shall call 2's» 


Gi = Pf = 5 arf. (3.20) 
We shall now determine the value of the normalization constant «,,: 
Oy On = (CH fons H- Son) = Seas Hs H— fo) 
= (f,,, (H? — H3 + H3)f,) = (+ 1) — m? + m. 
Thus, except for a phase factor (which remains undetermined), 
Om = /ME + 1) — m? + m= ./( + ml — m + 2D), (3.21) 
and we have 
H_ fi, = JU + m= m + 1) fina = Om Sma a (3.22) 


It remains to determine H, f,,. We know already that H, fi, < fia. 
We set H, fin = Bmfin+1 and calculate 


Gta ia) = (Hfes te = Cie ee) = Cee): 
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Figure 3.1 Example of a weight diagram of an irreducible 
representation of SU(2). 


Therefore 


Bn = ms. = /U + m — 1) — m) = B,,, 


and we have 


ee ae ee ee C2). 


Summarizing, we have found the following: For every integer or half- 
integer value | there is a space #' spanned by 2/ + 1 orthogonal vectors 
fi,(m = —l,...,1). In this space the operators H,, H_, H are given by 
(3.8’), (3.22), (3.23); and thus the action of any element A € &(SU(2)) given 
by (3.7) on any vector fe #' given by (3.20) is determined. To indicate that 
for every | one obtains a different operator, one could also write H®, H®, 
H® for the operators in (3.8’), (3.22), (3.23). The space A is called an irreduc- 
ible representation space of &(SU(2)). In this space the elements H,,H_, H3 
defined by (3.3) are represented by the operators given in (3.8’), (3.22), (3.23). 
These operators are called the (2! + 1)-dimensional irreducible representation 
ofthe H, ,H_,H,.Ascan be seen, they depend upon /; for each! = ie eee 
there is one “different” set of operators. All of the vectors in #! are eigen- 
vectors of H? with the same eigenvalue i(1 + 1). There is no element 
Aeé &(SU(2)) that can transform from a vector f'€ #' to a vector fle R" 
with | 4 I. This fact is expressed by the statement that A' is “left invariant 
by all A e¢ &(SU(2)).” In particular, it is left invariant by the J; (i = 1, 2, 3) 
and whe H.., H,. 

If we plot the possible values of m in an irreducible representation along a 
line, then we obtain the weight diagram of the representation characterised 
by |, of SU(2). This is the simplest example of a weight diagram for Lie groups. 
For! = 2 this is shown in Figure 3.1. To each - there corresponds a basis vector 
fi, in the representation space R', or equivalently there corresponds the one- 
dimensional subspace spanned by f/,,. Each such subspace (or basis vector) 
represents a (pure) physical state. Thus to each point in the weight diagram 
there corresponds a pure physical state. 

The smallest space is #°, which is one-dimensional, as shown in Figure 
3.2: 

In general, we have a weight diagram for each representation, and the 
action of the operators H , , H _ can be represented in this diagram, as shown 


———  ™ 
0 
1=0 
Figure 3.2 Weight diagram of the one-dimensional representation of 


SU(2). 
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= —2 = 0 +1 +2 +3 


Figure 3.3 Collections of weight diagrams of irreducible representations 
of SU(2). 


in Figure 3.3. For each weight diagram, there is a space #’, and for each space 
&' there is a possible state (or set of states) of the quantum-mechanical 
system characterized by the value /. Because of the correspondence between 
the operator L (2.10) and the classical angular momentum | (2.8), the number 
lis called the angular momentum quantum number: 


L2f! = AK + 1)f1. 


Thus the quantum-mechanical angular momentum can take only a discrete 
number of values. The physical state that corresponds to a given space #', 
and which is described by the statistical operator W = (dim #') 'A' = 
(21 + 1)~ 1A! where A! is the projector onto #', has a definite angular mo- 
mentum /. In general (except for |! = 0) this is not a pure state; it can be a 
mixture of states with different values of mim = —/1, —14+1,..., L. 

The space #' is the direct sum of one-dimensional spaces F,,, 


+1 
R= O4i, (3.24) 


m=-l 


and each &!, is spanned by the vector f',. 

It has already been mentioned that not all sets of linear operators J; 
(i = 1, 2, 3) that fulfill (2.16) can be given by (2.10). It can be proven (Problem 
III.1) that for L; given by (2.10) the number / can only take the values / = 
0, 1, 2,...;in other words, the operators L; given by (2.10) can be represented 
by operators in the spaces #' with! = 0, 1, 2,... only. Thus for the operators 
given by (2.9) there are a countable number of representatives L‘” in spaces 
R' that are left invariant by the L‘”. These spaces #! are not, however, left 
invariant by the Q, and P;. 

The operators L; in # with | = 5, 3, 3,... or any direct sum of them, 


Of 
l=half 
integer 
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cannot be expressed as functions of the operators Q; and P;. For | = 4 the 
operators L‘'='/) are called spin operators. The 2 x 2 matrices o; with the 
matrix elements 2(f/,'/?, L; fi, 1/2) are called Pauli matrices (Problem 
Ti 7). 


III.4 Rotation Spectra 


As mentioned before, the algebra &(SU(2)) does not contain operators that 
transform out of a given &'. The algebra of observables of the quantum- ~ 
mechanical rotator is, however, larger than &(SU(2)); additional elements can 
be formed, e.g., algebraic functions of J; and P; or of J; and Q;. The observ- 
ables Q;, for instance, have the property that they transform from a given 
R' into the neighboring #'*! and Z'"!: 


Q;:2'-2' ea", (4.1) 


but not into #'*" for n > 1. We shall prove this in the section on parity. 
The space of physical states @ of the quantum-mechanical rotator is the 
direct sum of the spaces #': 


A= SOR (4.2) 
1=0 


Before we justify this statement, we want to give a brief description of the 
properties of &. 

@ is not an irreducible representation space of &(SO(3)), the algebra of 
angular momenta. It is called a reducible representation space, and it reduces 
as given in (4.2) into a direct sum of irreducible representation spaces R 
(( =a0e1) 2, .. ether operators, H 2,4, now considered as operators 
in the big space &, transform every element of a given space & into an 
element that is again in the same @'. Thus the subspaces R' of B are left 
“invariant” by the H,, H_, Hy, and consequently by any A € &(SU(2)). 
The operator H?—which is a number in the space R', namely I(1 + 1)—has 
in Z a nontrivial spectrum, namely 


spectrum H? = {i(! + 1), 1 = 0, 1, 2, 3,...} (4.3) 


The weight diagram for the representation in # is shown in Figure 4.1. 

We shall now justify the statement (4.2). We assume that the J; are the 
angular momenta L; = €;,.Qj Px: then, according to the result stated in 
Section III.3 and proved in Problem III.1, only integer values of / are allowed, 
ie., @ contains only # with | = 0,1, 2,.... That # contains all the #' 
(I = 0, 1, 2, ...) follows from the fact that there are observables for the rotator 
(e.g., the operators Q;) that transform from a given R' to the neighboring 2 ' 
and &'*! according to (4.1). That each &%' appears only once follows from 
the fact that for the rotator no additional quantum number Is necessary; if 
one #'° were to appear twice or more, then there would be two or more 
vectors f'9(1), f/0(2), ... with the same quantum numbers |), m, and a new 


106 Energy Spectra of Some Molecules 


Figure 4.1 Weight diagram of E(3) or of SO(3, 1). 


quantum number would be necessary to distinguish between these two or more 
vectors. But the rotator is just that model for which there is no other diagonal 
observable besides angular momentum (L? and L;). (In other words, a 
real physical system can be a rotator only to the extent that no other quantum 
numbers are necessary for the description of its properties; e.g., polyatomic 
(symmetric top) molecules cannot in general be described by the rotator 
model, and even for the diatomic molecule, the rotator model is only an 
approximate description that neglects all but the rotational properties of 
the dumbbell.) Thus, as always, the justification of (4.2) is that in nature there 
are physical systems whose physical states are (up to a certain limitation) 
described by &. 


[| Mathematically this can be stated more briefly: “The spectrum- 
generating algebra of the rotator is &(E;).” &(E;) is generated by 
L;, Q; that fulfill the commutation relations 


[L;, Lj) = thea, Ly, [L;, Q;] = ihe;,.Q;, [Q;, Q;] = 9, 


and & is a particular irreducible representation space of 6(F3).. 


Each dot on the weight diagram of # represents the pure state that is 
described by the one-dimensional subspace #}, (/, m fixed) spanned by f;,. 
The (normalized) statistical operator for the pure state W = Aj,, where Al, 
is the projector on &',, represents a quantum-mechanical system for which 
the angular momentum has the definite value / and the 3-component of 
angular momentum, H;, has a definite value m. As there is no distinguished 
direction in space and the coordinate system has been chosen arbitrarily, 
H, represents the angular momentum around an arbitrarily choosable 
direction; it is also called the helicity. 

The values of the energy operator in &, i.e., the energy spectrum of the 
rotator is obtained from (2.11) as 


1 
spectrum H = E, = a h71(1 + 1). (4.4) 


Thus we see that the energy levels depend upon /, as represented in the 
diagram of Figure 4.2. If we compare this with Figure 1.7, we see that the 
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r 
(a) = 
ar. oe, 
——— ; 
) HTT 


Figure 4.2. Energy levels and infrared transitions of a rigid rotator: 
(a) The energy-level diagram, (b) the resulting spectrum (schematic). 
[From Herzberg (1966), with permission. ] 


rotator has indeed the energy spectrum required to interpret the infrared 
spectrum of diatomic molecules. 

In contrast to the oscillator, the energy eigenspaces (i.e., the spaces of 
vectors with the same energy eigenvalue) for the rotator are not one- 
dimensional, except for / = 0. Therefore the state of a rotator with a definite 
energy value E,, (lo = 0) is not a pure state. If energy measurements have 
been made with the result E,,, the statistical operator is given by 


W =A (unnormalized), (4.5) 


or 
W = (Tr A®)~ 1A" = (2g + 1) 1A’ (normalized), (4.5’) 


where A’ is the projector on the (2/) + 1)-dimensional space A, By an 
energy measurement alone it is not possible to prepare a pure state of the 
rotator. Only under certain additional conditions— if a direction in space is 
distinguished (e.g., by an external magnetic field) can one prepare a state 
with definite helicity, i.e., a pure state Aj... If only the energy of the rotator 
has been measured but not the helicity, then the states with different helicities 
are assumed to appear with equal weight, which is why one chooses 


W = Alo = Ae se (i as Ae ooo de Alo fiver Aje (4.6) 


for the (unnormalized) statistical operator. 
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In order to calculate the frequencies that can be emitted and absorbed by 
a rotator, we have to know the selection rules. In our classical picture of the 
rotator we can consider it as a rotating dipole moment D, with 


D = const Q, (4.7) 


where Q is the vector between the centers of positive and negative charge. 
Classically the radiation is then a consequence of the rotation of this electrical 
dipole moment. Quantum-mechanically, the intensity of the absorbed or 
emitted radiation is proportional to the absolute value squared of the matrix 
element of the operator D, i.e., proportional to 


Xfm l Ql fn? I? (4.8) 


Thus dipole radiation will only be obtained in transitions from states 
fin to fj, between which the matrix element (4.8) is nonzero (quadrupole 
and higher-order radiation is negligibly small). It will be shown in the 
section on vector operators that [as was stated in (4.1)] 


CfFLAQISiD = 0 unless] =f = +1. (4.9) 
That is, the selection rule for dipole radiation of the rotator 1s 
Al=1=]=T=-2u (4.10) 


If we compare this result with the experimental situation for CO, depicted 
in Figure 1.7, we observe complete agreement. Figure 1.7 shows the transitions 
between states not only with different values of angular momentum / but also 
with different values of the vibrational quantum number n. 

We also expect radiation from transitions between different rotator states 
of the diatomic molecule that belong to the same oscillator state n = 0. 
These transitions (absorption) are indicated by the f in Figure 4.2. The wave 
number of this radiation is given by Equation (1.1): 


Ep as, E, 


Vy = 
IYral ae 


With (4.4) and (4.10) we calculate 
hd+id+2)-(+ 1) 


Misia] = 1 Imhe 
== 2 2(1 + 1) = B27 + 1) (4.11) 
4ncl : 
where 
h 
igor (4.11’) 


Thus the spectrum of a simple rigid rotator consists of a series of equidistant 
lines, as schematically drawn at the bottom of Figure 4.2. 
We expect the frequency for pure rotational transitions to be much 
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Table 4.1 Absorption Spectrum of HCI in the Far Infrared [From Herzberg (1966), 
with permission. | 


J 
Vealc = 


ae = 20.79(1 + 1) 


y 

fet vey Avene —«-20.68(1 ++ 1) —- —0.0016(1 + 1) 
1 20.68 20.79 
a 41.36 41.57 
3 62.04 62.33 
4 83.03 5,4 82.72 83.06 
5 104.1 ae 103.40 103.75 
6 OC aaeeere 124.08 124.39 
7 1850) Saas 144.76 144.98 
8 165.51 Sane 165.44 165.50 
9 (ear 186.12 185.94 

10 1 a ors 206.80 206.30 

11 226.50 227.48 226.55 


smaller than the vibrational frequency, because the spacing between the 
rotational energy levels is much smaller than between the vibrational energy 
levels as seen in Figure 1.7 (note the scale factor of § there). 

The pure rotation spectrum lies in the far infrared. The absorption 
spectrum of HC] in the far infrared has been measured, and the experimental 
results are given in the second column of Table 4.1. From (4.11) we expect 
that the frequencies will have an equidistant spacing. Therefore in the third 
column of the table the differences between the neighboring levels are given. 
This difference must be—according to (4.11): 


Ay = 2Bycy- (4.12) 
From the Av,,, we obtain 
h 
See = (0.34: em -. 4.13 


The fourth column of the table gives the values calculated from (4.11) with 
the value (4.13). We observe fairly good agreement between the calculated 
and observed values if we compare column 2 with column 4. (The first 
observed value is for | + 1 = 4 because the frequencies for I’s lower than 3 
lie in the far infrared and outside the region investigated.) The last column 
of the table gives a fit to the expansion 


Vier. = 2b(1 + 1) — 4d(l + 1° (4.14) 


(b, d are constants). Comparing this last column with the observed values in 
the second column, we see that the agreement of (4.14) with the experimental 
data is far better than that of (4.11). The energy spectrum that corresponds to 
(4.14) is given by 


E, = [bl + 1) — d?(l + 1)? ]2ahe (4.15) 
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Figure 4.3. Energy levels of the nonrigid rotator. For comparison, the 
energy levels of the corresponding rigid rotator are indicated by broken 
lines (for J < 6, they cannot be drawn separately). [From Herzberg 
(1966), with permission. | 


(b, d are constants). The energy levels (4.15) have been drawn in Figure 4.3 
with an exaggerated value of d. 

The explanation for the better fit of (4.15) to the experimental values is 
that the diatomic molecule HCl is not exactly a rigid rotator. The bonds 
between atoms are not rigid, and the interatomic distance varies with the 
speed of rotation, giving rise to a centrifugal distortion. Equation (4.15) can 
be obtained if we return to the classical picture in which the molecule is 
considered as two hard spheres (atoms) joined, not as in Figure 2.1 by a 
rigid rod, but by a spring. If the molecule rotates about an axis perpendicular 
to this spring, then at equilibrium the centrifugal force ’/(4x*) equals the 
centripetal force k(x — x,), where & is the spring constant and x, the inter- 
atomic distance of the stationary molecule. Thus 


I? 
The energy of this system is [cf. (II.2.3) and (2.4) ] 
2 
a 1 “a= 2 
E= Dux? + 5k(x — x,)’. (4.17) 
Making use of the expansion 
= a(14 224... (4.18) 
and (4.16), one obtains for E 
ihe ee i 
E=—,P +~— (0) + 0(’)’). (4.19) 


Qux2 Pa 
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The first term is the energy of the rigid rotator, and the second term is the 
contribution due to the centrifugal forces. Going to the quantum system 
operator by replacing the number I’ with the operator L?, one obtains the 
energy operator 


ee! 2 1 
De 2u7kx? 
from which the spectrum (4.14) follows. The better fit of (4.14) to the experi- 
mental values confirms the above classical consideration. But we also observe 
that the empirical value ofd(d = 0.0004cm~ ' for HCl) is orders of magnitude — 
smaller than that of b, which is obtained from the above fit: 
h 


buci ta era 
827cluc 


H (L)2, (4.20) 


= 10.395 cm~!. (4.21) 


This shows that the rigid rotator is a remarkably good model of the rotating 
diatomic molecule. 

As we shall see below, the spacings between the levels of the rotating CO 
molecule are considerably smaller than for the HCI molecule. Therefore the 
pure rotation spectrum of CO lies at a considerably longer wavelength, 
where experimental investigation is very difficult. 

We now want to obtain some quantitative features of the classical picture 
for the diatomic molecule. From the value (4.13) we calculate the moment of 
inertia of HCI: 


taer= 209 x 10-*° pcm”, 


With 
Wan = x = 6.0 x 10°73 2 
and 
my = ce. 07 0s 4" g. 
A 
one calculates (2.7): 
ne aa = 1.63 x 10-24 g 


From (2.6) we may calculate the internuclear distance of the HC! molecules 
using the values Ijc) and Myc)! 


Xue) = 1.29% 10-7 cin 


Thus we have calculated from the infrared absorption spectrum that the 
size of the molecule is of the order of 107 ® cm. This order of magnitude agrees 
very well with the values of atomic and molecular radii obtained from other 
classical considerations. We want to stress, however, that x is the value for 
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the classical picture of the quantum-mechanical system and is not the expec- 
tation value of a quantum-mechanical observable. 


111.5 Combination of Quantum Physical Systems—The Vibrating 
Rotator 


We shall now combine the quantum-mechanical rotator model with the 
quantum-mechanical oscillator- model to form the quantum-mechanical 
vibrating rotator (or rotating oscillator) model. This will provide a descrip- 
tion of the experimental situation shown in Figure 1.5, 1.6, and 1.7. We first 
discuss the general case of a combination of two quantum physical systems. 
For this we require a new mathematical notion: the direct product or tensor 
product of linear spaces. 


[Let R, and R, be two linear spaces, let u; € R, and v; € R,, and let 
a;; € C (complex numbers). The set of all (arbitrarily large but finite) 
sums 


f= Yajur; (5.1) 
tJ 


where a;; takes any value in C, forms a linear space, which is called 
the direct-product space and is denoted R, ® R2. u;v; 1s the formal 
product, which is also written u;v; = u; @ v;. If (u, uw’), denotes the 
scalar product in R,, and if (v, v’), denotes the scalar product in 
R,, then the scalar product in R, ® R, is defined by 


(r aj; U;,0;, iy bt) = iy A: [Dim Uj. U))1(v;. v2 6 (5.2) 
ij lm ijlm 


(Remark: If R,; and R, are Hilbert spaces, then the “completion” 
of R, © R, with respect to this scalar product is the direct Hilbert- 
space product.) 

If d, is a basis in R, and wy, is a basis in R,, then 


Fru = Oy @ Wy = Oy (5.3) 


is a basis in R, @ R3. 
If A, is a linear operator in R, and A, 1s a linear operator in R,, 
then the operators 


in R, ® R, are defined in the following way: 
Cf = ey a; (A,u;) & Vj, (5.4’) 
ij 


Bf = » aj ju; ® (A20;). 
tj 
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The linear operator A = A, © A, in R, @ R;j is defined by 
Au; ® v,) = (A, ® A,)(u; © v4) = der (A,u)) @ (A20,). (5.5) 


It is easily seen that if A,, B, are linear operators in R, and A, 
B, are linear operators in R,, then 


A,B, @ A,B, = (A, @ A,)(B, ® B,). (5.6) 


Every operator A in the direct-product space is a linear combina- 
tion of direct products of operators, i.e., 


A= }) AY @ AY, (5.7) 
with A® linear operators in R, and A$) linear operators in R). | 


With the notion of the direct product of spaces we can formulate the basic 
assumption about the physical combination of two quantum-mechanical 
systems: 


IVa. Let one physical system be described by an algebra of operators, 
sf ,,in the space R,, and the other physical system by an algebra ./, in Ro. 
The direct-product space R, ® R, is then the space of physical states of 
the physical combination of these two systems, and its observables are 
operators in the direct-product space [given in the form (5.7) ]. The particular 
observables of the first system alone are given by A, @ I, and the observables 
of the second system alone by J © A, (J = identity operator). 


We reemphasize that IVa is a basic assumption of quantum mechanics and 
can only be justified by the fact that such physical systems exist. 

We shall now apply this basic assumption IVa to the diatomic molecule 
that vibrates and rotates. 

We called the space of physical states of the oscillator #. In # we intro- 
duced a basis of eigenvectors of the operator N or H,,.: 


basis of #: , = |n> i 0. 1255. .): (5.8) 


The action of all observables (all elements of the algebra of observables of 
the quantum-mechanical oscillator) of the harmonic oscillator on the basis 
vectors |n> is known from Chapter II. 

We called the space of physical states of the rotator #. In 2 we introduced 
a basis of eigenvectors of the operators L; and LL? Or Hey 


basis of @: f!, = |lm 
eas lo. m = integer with —! <m< +). (5.9) 


The space of physical states of the vibrating rotator is, according to IVa, 
the direct-product space 


S=H@&Z, (5.10) 
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and the observables are the operators );; A”. @ A®, where A®. is any 
observable of the oscillator and A‘, is any observable of the rotator. The 
basis system in S is obtained as the direct product of the basis systems ¢,, 
in # and f!, in &, and is denoted by |nlm): 


|nlm> = |n> @ |Im> = $, ® fm: (5.11) 


We have already mentioned that the rotating diatomic molecule is not a 
rigid rotator and the vibrating diatomic molecule is not a harmonic oscillator. 
Furthermore, the rotations and vibrations are not independent motions of 
the molecule: In our classical picture the diatomic molecule is a system of 
two mass points which are connected by a massless spring. Consequently 
there are interactions between the vibration and rotation caused, e.g., by 
the fact that during the vibration the internuclear distance x = (x;x;)'/? 
changes, and consequently the moment of inertia J = yx? changes. 

For the moment we want to neglect all these finer details and consider 
the idealized system that is simultaneously a rigid rotator and a harmonic 
oscillator—keeping in mind, however, that his is an idealized system, which 
can at best be only approximately correct. 

The energy operator of this idealized physical combination of the harmonic 
oscillator and rigid rotator is given by 


H=f,. @i+it oi... (12) 
where 
Hose = ha(N + $1) = > p24 ae Q? 
Qu 2 
and 
ee 


If the interactions between the two systems is neglected, all observables are 
given by 


A= Agse @ 1 + 1@ Arr. (5.13) 


From (5.12) we obtain the energy spectrum of the idealized vibrating 
rotator: 


1 
spectrum H = E,, = ha(n + 4) + a h71(1 + 1). (5.14) 


The experiments show that the system constants w for oscillator and J for 
the rotator fulfill hw > h?/(2!) (the pure vibrational transitions are in the 
near infrared and the pure rotation transitions are in the far infrared). The 
energy-level diagram that we obtain under these conditions from (5.14) is 
shown in Figure 5.1. To obtain the transition frequencies we use the selection 
rules 


An =i and Al= 41 (5.15) 
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Figure 5.1 Energy levels of the vibrating rotator. For each of the first 
five vibrational levels, a number of rotational levels are drawn (short 
horizontal lines.) [From Herzberg (1966), with permission. ] 


given by (1.10) and (4.10). If we consider a particular vibrational transition 
from nton + 1 (absorption) or n + 1 ton (emission), we obtain from (5.14) 


the frequencies (in cm™ *) 


pe je ee | 7 Ent 
x 2rhc 


, — Ent ii-t = ee 
J nhc 
= v, — 2Bl (ea 


for absorption, where B is given by (4.11’): 


neh __k 
 8r2cl 4ncl 
and 
y= 2 
0” ne’ 


vo + 2B+2BI (Al= +1), 


(5.16R) 


(5.16P) 


(4.11’) 


Thus we have two series of equidistant lines, which are called the R and P 
branches, with a gap at Vp (as Al = 0 is excluded by the selection rule). The 
corresponding transitions are indicated in the energy-level diagram, Figure 
5.2. The frequency spectrum calculated from (5.16) is depicted in strip (b). 
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Figure 5.2 Energy-level diagram explaining the fine structure of a 
rotation-vibration band. In general, the separation of the two vibrational 
levels is considerably larger compared to the spacing of the rotational 
levels than shown in the figure (indicated by the broken parts of the 
vertical lines representing the transitions). The schematic spectrograms 
(a) and (b) give the resulting spectrum with and without allowance for 
the interaction between rotation and vibration. In these spectrograms, 
unlike most of the others, short wavelengths are at the left. [From 
Herzberg (1966), with permission. | 


The observed spectrum from Figure 1.7 is depicted in strip (a). Thus the 
prediction of the vibrating-rotator model without interaction between 
vibration and rotation agrees rather well, but is not quite accurate. The 
observed lines in the R branch draw closer together, and those in the P 
branch draw farther apart, than the predicted equidistant lines. This is due 
to the interaction of rotation and vibration. 
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If one assumes that the moment of inertia is different in different vibrational 
states, then one obtains 
1 
pa i 


Eq = ho(n + 4) + — #1 + 1D) (5.17) 


instead of (5.14), and for the wave numbers of the resulting lines 


Ewy = Ew ! " rey neg ye 
= ee = volt’ — n") + ByP( + 1) — Bel" + 0), (5.18) 
where 
h 
i 
Prac 


From this one obtains the absorption frequencies for transitions nl” > n'T’ 


between neighboring vibrational levels: 


VR = Vo + 2B, + (3B,. fam Bl + (B, mae Bie 
d=1+1, 7 Al = +1), (5.19R) 


iy, — (B+ B+ (By — B, 0 
(=l=—1, 9 f=) “Al=—1. G19P) 


Equations (5.19) give excellent agreement with the empirical fine structure 
of the infrared bands. 

For the HCI molecule the values of B, have been obtained for the various 
bands n’ <n": 


0-1 0-2 3 0-4 Oi 5 
(transitions with An > 1 occur as a consequence of the small anharmonicity; 
cf. Figure 1.3). The results are summarized in Table 5.1. The difference AB, 


between successive values is very nearly a constant, so that B, can be fitted 
by the formula 


B, = B, — %(n + 4), (5.20) 


Table 5.1 Rotational Constants of HCl in the Different Vibrational Levels of the 
Electronic Ground State. [From Hertzberg (1966), p. 800, with permission. ] 


n B,(cm~') — AB, (cm~") 


0 10.4400 0.3034 
1 10.1366 

0.3037 
2 9.8329 

0.2986 
3 9.5343 

0.302 
4 91232 0.299 
») 8.933 : 
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where a, is a constant small compared to B, = 10.5909 cm~’, the equi- 
librium value of B,,. The value of B, given in the table for the rotation vibration 
spectrum agree within the accuracy of the measurements with the value 
BHC! — 10.395 cm! obtained from the pure rotation spectrum of HCl, 
Equation (4.21). 

Experimental values for the vibration-rotation spectra of the CO molecule 
are not as numerous or as accurate. From the spectrum depicted in Figure 
1.5 one obtains 


Be — 196 cme (5.21) 


This value, and hence the fine structure in the energy spectrum, is considerably 
smaller than that for HCl. 

The diatomic molecule with the largest rotational constant B,, and thus 
the largest energy difference between rotational levels, is the H, molecule, 
for which BE? = 60.80 cm7!. 

A qualitative theoretical explanation of (5.17) with (5.20) follows from the 
classical picture of the diatomic molecule as two rigid spheres connected by 
a spring. When this system is in a state of higher vibrational energy, it has a 
larger amplitude and consequently a larger moment of inertia. Consequently 
I, ' decreases with increasing n, as expressed by (5.20). 

For the quantum-mechanical observables, the empirical formula (5.20) 
means that in the presence of an interaction between vibrational and rota- 
tional degrees of freedom, the form (5.13) is not sufficient. For the energy 
operator one has in addition to (5.12) an interaction term, for which one may 
try as a first guess 


Aint = GA os cy) 1a hey (5.22) 


where g is a coupling constant of dimension (eV)~ '. Thus the energy operator 
for the vibrating, rotating, interacting diatomic molecule in this approxima- 
tion will be given by , 


H = H,,, @1 + 1 @ Ay + GHosc ® Hots (5.23) 
where 
Hose = ho(N + 4) (5.24) 
and 
i= ES lice (5.25) 
a 


7, is the moment of inertia that corresponds to the equilibrium separation 
par eecet yl ea 

The energy values of the diatomic molecule with vibration-rotation 
interaction are the expectation values of H in the physical states. It is, of 
course, not obvious that the |nIm> = ¢, ® f/, of (5.11), where ¢, are eigen- 
vectors of N and f,, are eigenvectors of L? and L;, represent the pure states 
of this physical system. However, as they happen to be also eigenstates of the 


Combination of Quantum Physical Systems—The Vibrating Rotator 119 


energy operator H of (5.23), they are the obvious choice for states in an energy 
measurement. Thus the energy values are the eigenvalues of H in the basis 
|nlm> given by (5.11) which are calculated to be: 


4 h? 
Em = h(n + 4) + oh h7(1 + 1) + gho or feel 1), (5.26) 
The wave number of the radiation quantum corresponding to the energy 


value, i.e., 
use En ae En 
,=—= 
" he 2nhe 


is called the term value* [cf. equation (1.12)]. The term value of the vibrating 
rotator are therefore, by (5.26), 


6.27) 


Val = Vo(n + 3) + (B. — a(n + 3) + 1), (5.28) 
where we have used the standard notation of molecular spectroscopy: 
h h 
Gel ~ Sep? ee 
ee a . (5.30) 
B, = B, — «{n + 3). O51) 


According to the above qualitative considerations, «, should be larger than 
zero, which is experimentally always fulfilled. Equation (5.28) with (5.31) 
gives for the wave numbers of the transitions in the R branch 


Vr = Va't+1 — Vail 
eG ar ee BeOS, — Bt + (By — Bp), (6.22) 


and for the wave numbers in the P branch 


Veit 1 Ya" 


Ge — 1) — (Ee 2B, + (8, — Be, (5.32P) 


i.e., the empirically well-established formulas (5.19). 

Two previously mentioned effects have not been taken into account in 
(5.23) with (5.24) and (5.25). These are the anharmonicity of the oscillator 
and the influence of centrifugal forces. Thus (5.28) is not the end of the story 
of the vibrating and rotating diatomic molecule. If these effects are also taken 
into account, then up to a certain degree of accuracy one obtains for the 
term values of a vibrating rotator 


v4 = on + 4) — oxen + 3)? + BI + 1) - D,d + 1)?, (5.33) 


4Note that we use the same symbol v for the frequency (in sec ') and the wave number (in 
cm~'); cf. statement following equation (1.12.) 
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where 
B, = B, — a{n + 4), (5.34) 
D, = D, + BAn + 4). (5.35) 
@, is the standard notation for 
7) tik 
Oe aes on ri (5.36) 
B, is given by (5.29): : 
h h 


= 5.29 

°  4ncl, = 4ncpx2 Ce) 
According to the semiclassical consideration leading to (4.20), D, may be 
expressed in terms of the reduced mass yp, equilibrium separation x, and 
spring constant k as 


h3 


= ———_—. 337 
4nckp*x® le 


r-J 


From (5.36), (5.29), and (5.37) it follows that the three system parameters 
D., B,, @, are not independent but are related by 


4B? 
D, = Pale (5.38) 


e 
The parameters x,, «,, B,, expressing the degree of anharmonicity, are known 
empirically to be small: 


which must be the case, as they represent the effect of corrections to models 
which are very well realized by microphysical systems in nature. The system 
parameters w,, x,., B., D., «., and B, have been experimentally determined 
for many diatomic molecules and are collected in tables (cf. Herzberg 
(1966]). Equation (5.33) gives a very good description of the vibration- 
rotation spectra of diatomic molecules, and only in exceptional cases are 
higher correction terms needed. 

Diatomic molecules are vibrating rotators only as long as the internal 
energy is sufficiently low—roughly, in the region of energy of infrared 
radiation. For higher energies (1-20 eV) which correspond to the visible 
and ultraviolet regions, the molecules are no longer just vibrating rotators, 
since new degrees of freedom become accessible to electronic transitions. 
In each electronic state the molecule is, however, still a vibrator, in the same 
way as in each vibrational state the molecule is a rotator. This leads to energy 
spectra as depicted schematically in Figure 5.3 for two electronic states. We 
shall not describe the electronic structure of molecules here, they are of the 


Ls 
i= 
7 
—S>= 2 
10 5 
—— 1 
———— 
——=r 0 
r ——— a ” 


——eEeee ee 
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oe ___ () 1 
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Figure 5.3 Vibrational and rotational levels of two electronic states A 


and B of a molecule (schematic). Only the first few rotational and 


vibrational levels are drawn in each case. [From Herzberg (1966), with 


permission. ] 
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same nature as the electronic structure of atoms which will be discussed in 
later chapters of the book. 


Problems 


1. Let P; and Q; (i,j = 1,2, 3) satisfy the canonical communication relation 
[P;, 0] = (h/i)6,;1. Define the orbital angular momentum L; = ¢;,.Q;P, (L = Q  P). 
The component operators are Hermitian [Equation (2.11a)] and satisfy the commuta- 
tion relation (2.14). Define H,, H,, and H _ as follows: 


(a) Show that 


and that 


Show that 


H,=h-'L, and Hy, =h'\(L, + iL»). 


[H3, Hz] = +H 


(H,,H-] = 2H3. 


(bes = LL; = h?(H, H_ AF fae a} H;3) 
= h>(H_H, + H} + Hs). 
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Collector Current, arbitrary units 


(b) 


(c) 


Show that H? = h~?L? is an invariant operator of &(SU(2)); i.e., show that 
(i?) Ao: Rees | = 0 

and 

fe 
and-consequently that 

[H’, A] = 0, 
where A is any element of the algebra generated by the L;, 
A=al+a'L,+ a¥L,L,;+--: 


(a, a’, a, ...€C). 
Prove that the spectrum of L? is #71(1 + 1) (1 = 0, 1, 2, 3, ...). Hint: Express the 
operator L, in terms of the annihilation operators a; and creation operators ae 


1 
a; = ——(Q; + iP)), 
2h 


Jah 


of the three-dimensional harmonic oscillator, and show that as a consequence 
of the spectrum of the harmonic oscillator L, can have only integral eigenvalues. 


2. Calculate the internuclear distance for the CO molecule using the absorption 
spectrum given in Figure 1.7. 


3. The energy-loss spectrum of vibrating H, molecules (Figure PS.1) shows 
two bumps at 0 and at 0.52 eV with respective intensities 3.5 and 7.8 x 35 = 0.26 (arbi- 


0 1 2 
Sweep Voltage, volt 


Figure PS.1_ Energy-loss spectrum of H, [From G. J. Schulz, Phys. Rev. 135, A988 
(1964) with permission. | 
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trary units), respectively. What is the statistical operator W for the ensemble of H; 
molecules in this energy-loss experiment? Can dipole transitions occur in this H, gas? 
What would be the frequency for these dipole transitions? 


4. Inan infrared absorption experiment with HCI molecules in the ground state it 
is observed that the following frequencies v (cm~ ') are absorbed: 


20.68 82.72 144.76 206.80 
41.36 103.40 165.44 227.48 
62.04 124.08 186.12 


Assume that in this absorption experiment only transitions between neighboring energy 
levels take place (dipole transitions). What are the energy levels of this (rotating) HCl 
molecule if the zero of the energy scale is fixed by Ey = 0? 


5. Let {fi=,'2, £45! 7}, be the orthonormal basis for the space 2”! ? defined in 
Section III.3. 
(a) Show that the matrices o, with matrix elements 6; = 2(f%, 1", Hef -yeare 


given by 
_ a pee 
Polit cle cslmeeo|  ~ (0 =1 


with respect to the basis { f/7',, fi7a''7}. The o; are the Pauli spin matrices. 


(b) Show that the Pauli spin matrices together with the unit matrix are a basis for 
the vector space of operators on 2". 


6. Let &; and #, be two finite-dimensional linear scalar-product spaces and 
R = R#, ® A, be their direct sum. Let A, bea linear operator in #, and A, bea linear 
operator in #,. The direct sum A, @ A, is defined as the map 


(A; ® A,)¥ = (A, ® A,)(P ® Y>) a (A,¥, ® A, >) 


for ¥,e FR; and P= 'P,@V,€R. Let H =, ® #, be the direct product of #, 
and #,. The direct product A, @ A; is defined as the map 


(A; ® A,)h = (A; ® A,)(hy 9 h2) = A,h, ® A,h, 


for hE &; and h = h, ® hoe #. 
(a) Show that in an appropriate basis the matrix a of the operator A, ® A> has the 


form 
0 ay) 
where a, is the matrix of A, and az is the matrix of A>. 
(b) Show that the matrix of the operator A, ® A, when written in an appropriate 
basis has the property 


— {yal 
A jx, it = Fi Akt > 
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i.e., may be written as a block matrix 


al) a) Ae ag’) 

a’) a’) we asa? 

aa’) a alq(?) 
or 

aa oe aa) 


where a is the matrix of A; (i = 1, 2) and a‘ are the matrix elements. 
(c) Show that 

(i) Tr(A, ® A.) = Tr A, + Tr A3, 

(ii) Tr(A, @ A) = Tr A, Tr Ay. 


CHAPTER IV 


Complete Systems of Commuting 
Observables 


In this chapter it is explained that the question of what constitutes a complete 
system of commuting observables is not a mathematical question but can 
only be answered by experiment. 


For the algebra of the quantum-mechanical harmonic oscillator, the eigen- 
vectors of the operator N [or of H = hw(N + 3D], 


N|n> = n\|n> (n= Oe) Cle) 


constituted a complete orthonormal system of # [see (11.3.25)]. For the 
algebra of the quantum-mechanical rotator a complete orthonormal system 
in Z was given by the eigenvectors of J and Js" 
J \jis> = JG + DR Liis> J3\ iis> = jshl isd. (1.2) 

(We shall use either j,; or m to denote the quantum number labeling eigen- 
values of J;, depending on typographical convenience for the topic being 
discussed. In this chapter we use j3.) Instead of the eigenvector [n> one can 
use the generalized eigenvectors |x) or |p> as a generalized basis for #, as 
expressed by Equations (II.8.16’) and (11.8.34). (Generalized basis vector of #? 
also exist for the algebra of the quantum-mechanical rotator.) 

The distinction between (1.1) and (1.2) is that for the one-dimensional 
oscillator (1.1), one operator is sufficient to define the basis. If for any vector 
od € # it follows that 


No = ag, (1%) 
Ss 
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then a is one of the nonnegative integers, say a = n', and @ = a|n’> where 
ae C. For the rotator model, two operators are needed in general to define 
the basis, and two are sufficient. If for any vector w€ & it follows that 


Ivy =ap and J, = by, (1.4) 


then (1) a = j'(j’ + 1)h? and b = jh, where j’ is an integer or half integer, 
and j3 1s one of the numbers —)’, —j’ + 1,..., j’;and (2) W = a|j'j5>, where 
aeC. Instead of J? and J; one could of course use any two independent 
(algebraic) functions f,(J’, J3) and f,(J’, J;) of J? and J;, i.e., functions f, 
such that j and j, are uniquely determined by the numbers /f,( j, j;) and vice 
versa. One would then obtain the same basis system. 

Instead of the basis system (1.2) one could also use another basis system, 


Coe 
J? | jin} = IG + Dh? | ifr} J2\ iz} = jot lio} (i) 
or any two other functions of the operators J; (i = 1, 2, 3) that commute 
with each other. To obtain a (generalized) basis system of # one need not 
restrict oneself to functions of J;, but could even use two functions of the 
J;, Q; (i = 1, 2,3) that commute with each other (e.g. Q; or P;).* The eigen- 
values of these functions usually specify a basis system of # completely. 


[Sometimes, but only in the case that at least one of these functions 
has a continuous spectrum, it may happen that in addition to the 
two (generalized) eigenvalues of these functions a further label is 
needed to specify the generalized basis system completely. | 


The system of commuting Hermitian operators that specifies the (generalized) 
basis system completely is called (following Dirac) a complete system of 
commuting observables (c.s.c.o.). The (generalized) eigenvalues of a cs.c.0. 
are called quantum numbers. 

For the quantum-mechanical one-dimensional harmonic oscillator, the 
c.s.c.0. consists of the one operator N (or H), but the operator Q may also 
serve as the c.s.c.o. For the rotator the c.s.c.o. consists of two operators; J? 
and J; are a convenient choice. 

From what was said above, and the fact that in the direct-product space 
one basis system is obtained as the direct product of the basis vectors in the 
two factor spaces, it is clear that one c.s.c.o. of the combination of two 
systems is given by the combination of the two c.s.c.o.’s for each subsystem. 

For example, a c.s.c.o. for the vibrating rotator is given by 


Neda e dae (1.6) 


For different algebras there are different c.s.c.o.’s. The larger the algebra 
(i.e., the more complicated the physical system), the larger is the number of 
Operators in a C.S.C.0. 


* Such a basis system is given in (3.76) of the Appendix to Section V.3. 
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[The problem of determining for which operator *-algebras there 
exists ac.s.c.o. is unsolved ; the requirement of the existence of ac.s.c.o. 
is certainly a restrictive condition. For certain classes of operator 
*_algebras .o there does exist a c.s.c.o. (e.g., if is the enveloping 
algebra of a nilpotent or semisimple group). This question is con- 
nected with the fulfillment of the conditions for the nuclear spectral 
theorem, of which we shall give a vague statement below. ] 


The physicist’s problem is not to find a c.s.c.o. for a given algebra,’ but is 
usually the reverse: From the experimental data he finds out how many 
quantum numbers are required, and what are the possible values of these 
quantum numbers. This gives him, according to the basic assumption Illa, 
the complete commuting system {A,} and its spectrum. He then conjectures 
the total algebra o/ by adjoining to {A,} a minimum of other operators such 
that the matrix elements of elements of . calculated from the properties of 
this algebra agree with the experimental values of the corresponding 
observables. 

Thus, the question of what is a c.s.c.o. for a particular physical system 
and the question of when a system of commuting operators is complete are 
physical questions. If an experiment gives more values than can be supplied 
by a given system of commuting operators, then this system is not complete; 
one has to introduce a new quantum number, i.e., enlarge the system of 
commuting operators. This usually requires a further enlargement of the 
algebra. 

We already treated one example of this procedure when we described the 
transition from the oscillator model of the diatomic molecule to the vibrating- 
rotator model. As long as one ignored the fine structure of the order of 
10-3 eV, the quantum number n was sufficient to describe the infrared 
spectrum. However, to explain the results of higher-resolution experiments, 
a new quantum number j—and therewith a whole new algebra, the rotator 
algebra—had to be introduced and combined with the oscillator algebra to 
give the algebra of the vibrating rotator. 

Because of the above described procedure for obtaining the algebra of 
observables .o/ for a given physical system, we can assume that for the algebras 
in quantum mechanics one always has a c.S.C.0.,2 Le., there exists a set of 
commuting operators 


A Ay hed 
such that 
Ault teks n> (1.7) 


! Although it would be nice to know the answer, as this restricts the class of admissible 


algebras. 
2 Mathematically this assumption can be stated: the conditions of the nuclear spectral 


theorem are fulfilled. 
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where the set of numbers A = {A} = {(A,,42,..., Ay)}, called the spectrum 
of A,, is real and in general continuous, and the set of generalized eigen- 
vectors |A, --- Ay>? is a generalized basis system. That is, every physical 
state vector ¢@ can be represented as 


b= | dua) [ay «+ ay Ay = Auld, (1.8) 
A 


where |, du(A) means summation and integration with certain weight 
factors (measures) over all values A in A. 

The statement (1.7) or (1.8) is the basis of the Dirac formulation of quantum 
mechanics. Recent results in mathematics assure us that the class of algebras 
that fulfill this statement is not empty. This was not the case when Dirac 
made his famous conjecture; none of the mathematical structures known 
at that time fulfilled the conditions justifying this statement. 

The statement (1.7) or (1.8) is the mathematical basis for the solution of 
actual problems; it does not tell us what the spectrum A and what the 
measure (A) (weight factor) is for the cases of physical interest. This depends 
upon the particular problem, and has to be solved for every particular case. 

An immediate consequence of the statement (1.7) or (1.8) is the following 
statement: Let A,,..., Ay be a cs.c.o. with (generalized) eigenvectors 
[Ay «+: Ay> = [A> such that 


Ada = Ane (eh 2 aN): 
Then from 
Ala ala) Ge NS (1.9) 
it follows that 
a,—A, and lata (1.10) 


where « is a phase factor. 

For the case that the spectrum A is continuous, i.e. that one of the A; can 
take continuous values, the formulation of (1.10) has to be given a precise 
meaning. In the following we shall generally use this statement only for the 
case of a discrete spectrum. 


> [In general, |A---Ay> € ®* and the physical state vector ¢ € ®.] 


CHAPTER V 


Addition of Angular Momenta— 
The Wigner-Eckart Theorem 


In Section V.1 the elementary rotator is defined as the system described by 
an irreducible representation of the algebra of angular momentum. Section 
V.2 discusses then the direct product of two irreducible representations of 
angular momentum and its reduction with respect to the total angular 
momentum. The Clebsch-Gordan coefficients are introduced, their recur- 
sion relations are derived, and their most frequently used values are tabulated. 
In Section V.3 tensor operators are introduced and the Wigner—Eckart 
theorem for the rotation group is stated without derivation. In Section V.4 
a new observable, parity, is introduced. Parity is then applied to discuss the 
spectrum of diatomic symmetric-top molecules. In an Appendix to Section 
V.3 the irreducible representations of the algebras of SO(3.1), SO(4) and 
E(3) are derived. 


V.1 Introduction—The Elementary Rotator 


In Chapter III we have treated the quantum-mechanical rotator. Its space 
of physical states is given by (III.-4.2): 


A= 3 OR, (1.1) 


1=0 
where #! is the irreducible-representation space of the algebra of angular 
momentum &(SO(3),,), and 
Ci Lj] = ihe; 1, Ly. C12) 
&(SO(3),,) is a subalgebra of the algebra of observables for the rotator. 
WS 
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Let us now consider the physical system whose space of physical states 
is #', with an arbitrary fixed value / = 0, 4, 1, 3,..., and whose algebra of 
observables is the angular momentum algebra. Every rotator can, under 
certain conditions, be considered as such a physical system —namely, if the 
energy E°*' that can be transferred to or from this rotator is small compared 
to the energy differences E, — E,_, and E,,, — E,. In nature, this condition 
is rarely fulfilled, because a state of higher energy always tends to decay into 
a state of lower energy “spontaneously,” and therefore the excited state has 
only a limited lifetime, unless E, is the ground-state energy value. It is, 
nevertheless, customary to consider these unstable states often as independent 
physical systems, not only in nonrelativistic quantum mechanics, but also 
in the relativistic case of elementary-particle physics. We call a physical 
system whose space of states is #' an elementary rotator. Elementary rotators 
are not just substructures of the diatomic molecule; they appear in other 
quantum-mechanical systems in all areas of physics. 


V.2 Combination of Elementary Rotators 


According to the basic assumption IV, the space of physical states of the 
combination of two elementary rotators with spaces #1, and #3, is the 
direct-product space #1, @ #3, = B. Let J{’) and J be the angular- 
momentum operators in #/;, and # 3) respectively, and let 

[Fx > (my = —Jy,---s +41) 
and 

|jzm2> (mz = —j2,..-, +J2) 


be the basis systems in Alt, and #3) respectively. Then a basis system in 
Ri, ® Ai, is given by 


limp ® |j2m2> = |j,myj2m2>. (2.1) 
The J (a = 1, 2) fulfill the commutation relations 
(I, IP] = hey JP. (22) 
We define operators in # = Zi1, @ Ai3, by 
J,= IM @1+41@ J. (2.3) 


As a consequence of (2.2) and (2.3) it follows that the J; fulfill the commutation 
relation 


(J Jj] = ihe; 4.5, (2.4) 


Thus the J; of (2.3) fulfill the defining relations of the generators of the 
algebra of angular momentum, and are thus a representation of angular 
momentum in the space #1, ® A/3,. The J; are called the total-angular- 
momentum operators of the combined system. 

The basis system (2.1) of # consists of eigenvectors of the following 
complete set of commuting operators (c.s.c.0.): 


jo2 ® ie ID ® I, ] 3) ee I ® Ai (2.5) 
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with the eigenvalues 


J? @ I jmyjomz> = j,G, + Yh? Limi j.m2> 
JS? @ I j,myj2m2> = my h\jym,j2mM2> 
I @ I? |jym,j.mz> = joi2 + Wh* Liimij2m2> 
1 @ JP |j,myjzmz> = myh|j,m,j,m2)>. (2.6) 


We now introduce the abbreviated notation for the operators in #: 
JY) = JM @ I, ee] Oh?) (27) 


These J are operators in the space 2 = #/\, ® A}, in contrast to the 
original J‘, which were operators in @/z) only. The J{” of (2.7) clearly fulfill 
the commutation relation (2.2), so we denote them by the same symbol. 
However, we should keep in mind that the old J are the restrictions of the 
new J to the space Bis). 

The basis (2.1) is in general not a physical basis for the combined system. 
A basis is physical if it consists of eigenstates in which the physical system 
can be prepared. If all physical states are eigenstates of the energy operator 
H of the physical system, then the physical basis must consist of eigenvectors 
of operators that commute with the energy operator. In general not all the 
J® commute with the energy operator (in particular, not if there is an in- 
teraction term between the two angular momenta). 

The operators that in general commute with the energy operator are the 
total angular momenta J; (they commute with H whenever H is rotationally 
invariant). Therefore it is convenient to choose a basis in # that consists 
of eigenvectors of the system of commuting operators 


JO2=JO72@l, JM? =1@I?%?, P=> J? Js. (2.8) 


We denote this basis by 
livjoj m>. (2.9) 
It has the properties 
J Vj, jojm> = ja + Ih? lfrji2im)>, 

Jslisioi m> = mhljyjr2jm), (2.10) 

VP litiafm> =IU + Dh’ livid m>. 
Each vector, and consequently each basis vector of the basis (2.1), can be 
expanded according to (1.2.15) with respect to the basis system (2.9): 


lismij2m2> = DS livind M> Gidod mj, myj2M2>; (2.01) 
j,m 
and each basis vector of the basis (2.9) can be expanded with respect to the 
basis system (2.1), 
livji2ijm> = » lismyj2mo> Gimis2MalirJoJ m). (2.172) 


my,,m2 
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The transition coefficients <j,;m,j,m2|jyj2jm)> are called the Clebsch- 
Gordan or Wigner coefficients, and are denoted by 


Gi My j2My|jsj2jm> = Gym, j2m|jm> 
= C(j;j2j mmm) 
= <m,m,|j m)> (2.13) 
= Gear 
for fixed values of j, and j,. In (2.13) we have given various notations used in 
the literature. 7 
Taking the scalar product of (2.11) with |j,m‘,j.m‘,> we obtain the orthogo- 
nality relation of the Clebsch-Gordan coefficients: 


Orsini Ome = un iim j2m>|jm>Gm\|j,m,j2m2>, (2.14) 
J,m 


and taking the scalar product of (2.12) with |j, j2j’ m’>, we obtain 
957 0mm! = pS Gjm'|j,m,j2M2> Gam, j2m2\j m>. (2.15) 


my, m2 
We now want to find the spectrum of J* and J; in 2 = Bi, @ B,, 
i.e., the values that j and m in basis system (2.9) can take. As j, and j, are 
fixed values, we denote 


liviatm> =|jm> and |j,m,> @|j2m.> = |m,> ® |m.> = |m,m,} 


For the sake of clarity we denote in the following calculation the direct 
product basis by |j,m,j.m2} = |j,;m,> ®@|j.m.> and the total angular 
momentum basis by }j,j.jm). 

(A) The vectors|j,j.jm>and|j,m,j,m } are eigenvectors of the operator 
J, = J} + JY with eigenvalue m and m, + m, respectively. The highest 
possible value of m is j, and the highest possible value of m, + m, is j, + jo. 
The highest possible eigenvalue of J; is therefore on the one hand j and on 
the other j, + j,. Consequently j = j, + j, is the highest possible value for 
m =m, + my). Let us therefore consider |j,j.j j> and 


VdsJoj2} = litd1> ® li2j2>. 
(B) The vectors |j,;m,j,m,} are eigenvectors not only of the c.s.c.o. (2.5) 


but also of the three operators J“)*, J)”, J, of the c.s.c.o. (2.8). The particular 
vector |j,j,j2j2} 1s however also an eigenvector of J?. 


PROOF. 
J? = jey2 Se QV. J af Ji?) 2 
a JO? 4 2 4 DIM 4 (FQ) 4 FYI?) (2.16) 
because 
FDI2) 4 JWI = AQpMy@ 4 JQ, 
Thus 


FP lisis> @ lied = HG. + D + joGi2 + 1) + 2jaj2)} Mi i1> @ lizta>, (2.17) 
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because 
4 IJ@IM + JOIP) j2j2> @ \j2jo> =, 


since m, = j, and m, =j, are the highest possible values of m, and m,, and thus 
Jj, 5,> = Oand J? |j, j.> = 0. But 


HG. + D+ join + 1) + Ye = Gi + H)G1 +h + YD 
ee): (2.18) 


since, according to(A),j = j; + j2. Consequently 1), j,> © |j2j2> is an eigenvector of the 
complete commuting set (2.8) with eigenvalues 


iG: + YD joliz + YD m=j=j,+J2 
and 
G+ D=G1 +5)G1 + j2 + 1). 


Thus | j;j;> ®@ |j2j2> and |j,j2j/> are both eigenvectors of the complete commuting 
set (2.8) with the same eigenvalues, and consequently according to (IV.1.10) 


lidi> ® Vizio = HID,» Hale. 


If they are both normalized, they differ only by a phase factor, which we choose to be 
unity; thus 


litjiJodo} = lis di> @ li2d2> = lirdai> G =s1 + ja) (2.19) 
or 
(Arjijojalim> = 1 (2.20) 


This vector (2.19) spans the one-dimensional space #;,,;, of eigenvectors of J; = 
J® + J? with highest eigenvalue m = j = j, + jo. 
(C) Let us now consider the space #;, , ;,- of eigenvectors of J, with eigenvalues 


This space is spanned by 
lids — 1 @lioi2> and lid) @ lizd2 — YD, (2.22) 


because both vectors (2.22) are eigenvectors of J; with eigenvalue j, + J. — 1. Thus 
Bie, 1 18 two-dimensional. One basis vector of the type (2.9) in #;, 4 ;,-1 18 


lidoi =r + i2m=j— 1), (2.23) 
which is obtained from (2.19) by applying 
tae Je + J), (2.24) 
The second one, which is orthogonal to it, must therefore be 
liz HA th -im=aAth—-—V, (2.25) 


because only such a value of j will have an m = j, + jz — 1. This vector is specified 
by the condition that it must be an eigenvector of the c.s.c.o. (2.8) with the indicated 
eigenvalues and is therefore [according to (IV.1.10)] determined only up toa phase factor. 
We fix this phase factor by requiring 


{jided2 — WW =A tio - 1 m = j) real positive. (2.26) 
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We proceed and consider next the space #;,.;,-2 of eigenvectors of J, with eigen- 
value 


m =f, + j2 — 2. 
This space is spanned by 
li —2>@li2> = i ~ 1 @j2-1) and |f1> @lj2 — 2 (2.27) 


(where we are now using the abbreviation |m,> ® |m,)> for |j, m> ® |j.m >) and is 
thus three-dimensional. One basis vector of the type (2.9) in #;,;,-2 is obtained by 
applying J_ to (2.23) and is 


VG =Ji +j2m =j — 2). (2.28) 
The second vector of the type (2.9) is obtained by applying J _ to (2.25) and is 
=h1 tie —Lm=j, + jo — 2). (2.29) 
The third one is orthogonal to these two and must therefore be 
l§G=hy +jp —2m =f, + jo —2), (2.30) 


because only such a value of j will have m = j, + j, — 2. To fix the arbitrary phase we 
require 


Uiididede — 2\f =j, + Jj. + 2m = j> real positive. (2.31) 
(D) In this way we proceed to the spaces 
Bay, Base aaa (2.32) 


with dimensions 4, 5,... and with new values of m =j, + j. —3,j,; +j.—4..., 
respectively. 

Without loss of generality we assumej, <j, (ifj, > j, we just renamej, > j,,j, > j,). 
Then the highest dimension of the eigenspaces of J, must be 2j, + 1. This is the di- 


mension of the space @,,_ ;, spanned by 


|—J1> @ li2?,1—di + D @ liz — 1),.--s i> @ lhe — rd. (2.33) 


This introduces the ultimate new value for j:j = j, — j,. The arbitrary phase of the 
vector 


Ee pe P, (2.34) 
is again chosen so as to make 
{AiJiJ2J2 — il) = j2 — jm = j> real positive (2.35) 
Equations (2.20), (2.26), and (2.31)-(2.35) are summarized in the phase convention 
{iij2m>{jj> real positive. (2.36) 


This convention is the standard phase convention for the Clebsch-Gordan coefticients. 
Thus the spectrum of j is 


Jh dasa + )s— ai io pay F = is 
or if we also admit j, < j,, 


J=h +d Jo — Liye ee (2.37) 
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(E) The space spanned by the 2j + 1 vectors 
pr (aj, 7 + 1, ---, +7) 
we denote by #/. This is the space that is left invariant by the operators J; = Jj”) + J 12). 
Then (2.37) can also be expressed as 
Rit, @ Ris, = B+2 @ QR]... @ BUI", (2.38) 


This is called the reduction of # into a sum of irreducible angular-momentum spaces. 

Summarizing, we have seen that the space Ri, ® Ai, is in general not an irreducible- 
representation space or ladder-representation space of the algebra of total angular 
momentum &(SO(3),,). That is, not all vectors of Ri, © Ai, can be obtained by 
applying the J. a sufficient number of times to one of its vectors, but rather it is the 
direct sum of several such irreducible representation spaces #/ as given in (2.38). We have 
seen in particular that the Clebsch-Gordan coefficients, which are the transition co- 
efficients between the two basis systems (2.1) and (2.9), are zero unless j is as in (2.37) and 
m=m, + m,: 


{m,m,|jm> = 0 
form#m, +m, j#i, tjads tjo—1--- lie Fal: (2.39) 


(F)! The Clebsch-Gordan coefficients are calculated recursively. The recursion 
relations are obtained if one takes the matrix element of J, = J‘) + J between the 
states |j m> and |m,m,>: 


fm, m2|F4\im> = SiG + 1) — mm + Dh{m,m,|jm + 
= {m,m|JP ar JP | jm) 
= (JP + JP)|m, m3}, Lim) 
= SiG, + 1) — m(m, F h(n, F Lm lim) 
+ Jig + 1) — mlm, F Dh{m,m, F 1|jm)>. 2.404) 
Here we have used (III.3.23), 
Li fi, = JUL + 2) — mm + Di nes (111.3.23) 


for 


and 
gor = Jo. 


In the first step we shall describe the calculation of the Clebsch -Gordan coefficients 
{m,m2|jj>; in the second step we then turn to the calculation of Clebsch-Gordan 
coefficients for all values of m, {m,m.|j m). 

Setting m = j in (2.40+), we obtain 


—JjxGy + 1) — mm, + 1m, — bmg |41> 
= /jxjz + 1) — mm, — 1){mym, — Ij. 241) 


1 In the following a brief outline of the calculation of the Clebsch-Gordan coefficients is 
given, only the results given in the table will be needed later and this part may, therefore, be 
omitted at a first reading. 
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Because of (2.39) we can restrict our attention to the valuesm, + (m, — 1) = (m, — 1) + 

m, = j(=m). Then (2.41) can be written 

Gal ge) ee 
Gi + m)Gi — m, + 1) 


1/2 
{mj — m|jj> 
(2.42) 


fm ~1j—m. + t= -[ 


Starting from m, = j, one can calculate all {m,m,|jj> successively from (2.42). One 
obtains for an arbitrary m, and m, = j — m, + 1 
{mm |jjy =(-1)-™ : 
: Cra iit 0nd Ia 2) Oar se 
2ji(2j1 — 1)-++ (2, — i, + my + 1) 


¥ Cee ee |) 


eg =a EE ess NED, 
He Pies) AO eet ine OL ae ee ee 

Ss fe 

QiyGr— my'Gs 7 Fm) Gs +) — 3)! 
(2.43) 

From the orthogonality relation (2.15), it follows that for j = j’ = m = m’ 
thn 
1= > {m,m,\jj>/?. (2.15’) 
mi=—ji 


Inserting (2.43) into (2.15’) and making use of the equality? 


sy fe EGae a) Se tele ee Cp oe sa) a 
wl) Gp 9 my) Qi ViGy js — 7) 


one obtains 


(2j,)'(2j + 1)! 
Ci + jo cme ats D' Ga ay + ij)! 


lis =Jr Vp — 


And with the phase convention (2.36) one obtains 


= = +4 F 2.46 


This equation gives all coefficients that occur on the right-hand side of Equation (2.40) 
for m = j. Equation (2.40—) then gives the values of 


tm m,|jm =j — 1) 
and successively the values of {m,m,|j m> for all m = j — 1,j — 2,..., —j. 
The recursion relation (2.40—) for fixed values m,, m,, j can be solved, 


and an expression of {m,m,|j m) in terms of {m,m,|jj> can be given. From 
this, using (2.46), a general formula for the Clebsch-Gordan coefficients 


? A derivation of (2.44) can be found in Edmonds (1957, Appendix 1), 
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Table 2.1 <j,m,3m.| jm) 


m, =4 m, = —4 

ej, +3 ij, t+m+4 jy —m+4 
OT aie | 2j, + 1 

5, = _ jit iy tm+4 
= 2j, + 1 2j, + 1 


can be derived. We shall not give this derivation here,* but give just one of 
the formulas for the Clebsch-Gordan coefficients: 


{iijim2|jm> = CisjiJ2m2|jm> 


A E pee ia Ui Sa I tS to 
m;,+m2,m = = 5 
G; +Jj. +5 + 1)! 


x [G1 + m)'!G — m)!G2 + m)!(j2 — m2)'G + m)!G — m)!]*/? 
«EC-V lzGs + ja J — DMA Im — 2)! +m = 2)! 
mj —Jo + My + 2)K] — Jt —m, + z)!}} (2.47) 


We also list some of the properties (symmetry relations) of the Clebsch- 
Gordan coefficients: 
as fee: reali” >. fae 
<jymyj2Mlisi2i3™s> = (146 es <j2 — M2Jj3mM3 linjsj1™1>- 
1 


(2.48) 


We now give explicit expressions for the Clebsch-Gordan coefficients for 
the cases j, = 0, 3, L. 
For j, = 0; 
<j,m,00|j m> = 9j,;Omim- (2.49) 
The values for j, = } and j, = | are given in Tables 2.1 and 2.2. 
Instead of the Clebsch-Gordan coefficients it is frequently more con- 


venient to use the Wigner 3-j symbols, since these display symmetry prop- 
erties more clearly. The 3-j symbol is defined by 


i ae <a apee 
= (- 1p mig + Io? ij, ee 2.50 
a mM, E) ( Ni ( 33 ) <jyMyj2M2/)3 m3» ( ) 


3 A derivation using the same notation as used here is given in Edmonds (1957, pp. 44 45). 
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Table 2.2 ¢ j,m,1m,|jm> 


j= m, = 1 iO m, = —1 
a, jomemGnt ma) Paes) (i = mG, — m+ 
a 2, + DQ; + 2) Qj, + DG, + D G+ NG, aa 
ji +m), — m+ 1) fe — mG, +m +1) 
"NV 3G FD a +1 2,0, + 1) 
eae Gi; — mG, — m+ 1) - f= +m) e +m+ IG, +m) 
a 2,2) + 1) FC: 7,2), + D 


Its symmetry properties are given by: 


Ji J2 J3 = J2 J3 di _ Js ji J2 (2.51) 
Mm, M, Ms, mM, Mz mM, Mm, Mm, Mz ; 


eareraroe! by) ame! | ae Jo hi ods 
[Wieder = 
ee (/" ( 


mz, Mm, My; 
Be i a eee 
m, mz; mM; oe Oe 


and 


i Ja Js oan ji Ja J3 
= (—1pitetis : 25 
Table 2.3* lists several values for the 3-j symbols from which many others 
can be calculated using the symmetry properties (2.51)-(2.53) 


V.3 Tensor Operators and the Wigner—Eckart Theorem 


We have considered the angular-momentum operators in some detail in the 
previous section. iNey we consider a more general class of operators, called 
tensor operators,’ which are defined by their relationship with the angular- 
momentum operators. The simplest example of such an operator is a scalar 
operator, which is defined to be any operator S satisfying 


ol 0: Gel) 
Another example is any set of three operators V, (i = 1, 2, 3) that satisfy 
[Ji, Vil = hey, V, eek — lh) (3.2) 


* From Edmonds (1957), with permission. 
° We shall actually only consider what are called “irreducible tensor operators.” 
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Such a set of operators is called a vector operator or regular tensor operator. 
We note that the angular momentum itself is a vector operator. 

Rather than using the “Cartesian” components V,, it is more convenient 
to use the “spherical” components Vo, V,., defined by 


dE | 
Vo = V3, Veise Wie + iV,). (3.3) 


Note that the H, of (11.3.1) (or the J, = hH,) are not the spherical com- 
ponents of a vector operator, but differ from them by the factor + 1 ie or 


—1 We - therefore we use the notation J,, for the spherical components 
of the angular momentum operator, in contrast to J, for the raising and 
lowering operators. Equation (3.2) becomes 


ew) 0 (x = —1,0, +1), (3.4a) 
(Je,Verd=JS2HVo (Je. Vol = VS 2hV21 os Yel = whVan. 
(3.4b) 


In general, we define a tensor operator of rank j to be (in spherical com- 
ponents) a set of 2j + 1 operators® TY) (kK = —j, ~j +L... + j) that 
satisfy 


[Jo, i = Kne, 
Ja, TH) = SiG + 1) — ee + DATA. (3.5) 


This can be written in a more compact form using the Clebsch-Gordan 
coefficients: 


[J TP] = SiG + Ddielulin + OTR, (3.5) 


We note that scalar and vector operators are tensor operators of rank 0 and 
1, respectively. 

The matrix elements of tensor operators have an important property which 
is expressed by the Wigner—Eckart theorem: 


Let T be a tensor operator. The matrix element of T”) between the angular 
momentum eigenstates may be written as 


Gjim' |TO|jm> = GmI Kljimy GTO), (3.6) 


where (jmJx|j'm'> is a Clebsch-Gordan coefficient and the symbol 
Cj | TO \|7> [defined by (3.6)] denotes a quantity that depends on j’, j, J and 


© We emphasize the use of spherical components by the use of Greek letters such as w in 
labeling the components. While such emphasis is not needed for the general tensor operator, 
in the case that the tensor operator is a vector operator the use of Greek letters will enable us 
to distinguish between the Cartesian components (Latin letters) and the spherical components 
(Greek letters). 
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the nature of the tensor operator T! but does not depend on m, m’, or kK. 
<j'||T2|7> is called the reduced matrix element’ of the tensor operator. 


This theorem will not be proven here, since the proof is pure mathematics 
and does not give any additional insight into the physics. If in addition to j 
and m there are other quantum numbers, say n = (a,, a2,..., Gy), then the 
reduced matrix element will in general also depend upon these quantum 
numbers, 1.€., 


<nijim |T2\njm> = GmI «lfm nj |TOlnd. (3.6') 


It should be emphasized that the Wigner—Eckart theorem is both a theorem 
and a definition. It is a theorem in that (3.6) says that the matrix element of 
T\’ can be factored in a certain way, and it is a definition in that (3.6) defines 
the reduced matrix elements. 

The Wigner-Eckart theorem has become one of the most important tools 
for the understanding of physics. Equation (3.6) is the Wigner—Eckart 
theorem for the rotation group, which is connected with the algebra of 
angular momenta J; as an algebra of observables. Many physical systems 
have an (enveloping) algebra of a group as a subalgebra of the algebra of 
observables. And they have observables that are tensor operators with 
respect to this group. For those observables the Wigner—Eckart theorem, 
a generalization of (3.6), holds. 

The Wigner-Eckart theorem expresses the matrix elements of the tensor 
operators, which are directly connected with the numbers measured in an 
experiment, in terms of Clebsch Gordan coefficients and reduced matrix 
elements. The Clebsch-Gordan coefficients are known mathematical 
quantities and are calculated from the properties of the group. The reduced 
matrix elements are physical parameters whose values have to be obtained 
from the experimental data. {If an observable has further additional proper- 
ties—e.g., Equation (Vf.3.12) of the Lenz vector in Chapter VI—then the 
reduced matrix elements can be reduced to a still smaller number of param- 
eters.] Thus the importance of the Wigner-Eckart theorem is that it allows 
one to fit the large number of experimentally observable matrix elements in 
terms of a much smaller number of more fundamental quantities, the reduced 
matrix elements. The property of being a tensor operator is often all one 
knows of an observable, and the Wigner~Eckart theorem then is the only 
tool at hand. 

A particular consequence of the Wigner—Eckart theorem (3.6) is 


Gim'|T|jm> = 0 
ifm’ AKk+m orj#AJ+j,34+j—1, (3.7) 
7 In some textbooks, a constant factor or function of j appears explicitly in the Wigner- 


Eckart theorem. In our notation these factors have been absorbed into the reduced matrix 
element. 
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which follows immediately from the properties of the Clebsch-Gordan 
coefficients (2.39). 


As an illustration of the Wigner-Eckart theorem, we note that for a scalar 
operator S, Equation (3.6) becomes 
G'm'|S|jm> = OnmdjiISIF>- (3.8) 


This says that S cannot change the angular-momentum quantum number, 
which is just as we would expect. For the angular-momentum operators 
themselves, Equation (3.6) gives 


<j'm'|J,|im> = imi xlj'm') <I. (3.9) 


where the Clebsch—Gordan coefficients are listed in Table 2.2. By comparing 
the Clebsch—Gordan coefficient in the table with the formula (III.3.8’), 
(II1.3.22), and (II1.3.23), we see that 


TID = 6 jh IG + VD. (3.10) 


If |jm> is a basis in the space in which J, is a general vector operator, then 
we can expand V,.|jm> with respect to this basis: 


Vlim> = ¥'|j'm'> <j'm' [Veli m) 
J'm' 


and obtain, according to the Wigner—Eckart theorem (3.6), 


Velim> = ) |j'm’> Gm 1 xlj'm') GV. an) 
jim’ 
If the c.s.c.o. contains, in addition to J* and J,, N other operators A,,..., Ay 
with spectrum 7 = (a,, 4>,..-., Gy), then the basis is |” jm), and instead of 
(3.11) one has, according to (3.6’), 
Veinjm> = > |n'j'm'>Gm1 lim) nV ind (3.11') 
a j'm’ 


We shall for the remainder of this section suppress the additional quantum 
numbers y, but it should be understood that whenever additional quantum 
numbers 7 are needed, the reduced matrix elements will depend upon these 
quantum numbers and summation over y’ is implied. 

According to (3.7), the only nonvanishing terms in (3.11) are those for 
which j’ =j + 1,j,j —landm’ =m+k: 


Veijm> =|j-—1m+ x) Gmiklj-Im+mgG— IVI) 
+ |jmt+KyGmikijm+ we GilViyp 
+itim+neoUmikljtim+ wo gt Vp. (12) 


This is the most general form possible for the action of a vector operator. 
According to (3.12), then, any vector operator can be completely specified 
by three reduced matrix elements (which in general may depend upon 
yn, n’). Actually, as we shall show below, only two quantities are needed to 
determine a vector operator. 
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By using the Clebsch—Gordan coefficients of Table 2.2, we can write (3.12) 
explicitly. For the 0-component, 


Yolim> =1j—1m(— [LER Gs — ayvayy 


m 
. VI; 
+ lim> oo) li> 
G—m+i)G+m+ 1) 
Qe hgh 


+14 Lm )o+ 11M tn. 


or 


Volim> = ./j? — mc;|j — 1m) — ma,|jmy — ./G + 1)? — md;|j + 1m), 
(3.13) 


where the c;, a;, and d; are defined by Equation (3.13) itself: 


Poe 
: JiQi + 1) 
iV 
—— (3.14) 
Se Neat 
= G+ IVI 


i ~ TGs e+ v 


Likewise, 


Vislim> = —|j-1m+1)/G—m-D)G-—m/2¢, 
+ |jun + 1>,/G + m+ DG — m)/2a; 


—lj+imt+1>/Gtm+)G+m+2/2d, (3.15) 
and 


V_slim> = —|j—1m—1)./G + mG +m — 1)2c, 
—|jm—- 1>./G - m+ 1)G + m)/2a; 
~|j+im—1>/G—m+)G—m+2)/2d;. (3.16) 


Equations (3.13), (3.15), and (3.16) say that a vector operator is completely 
determined by the three functions c;, a;, and d,; (which may in general 
depend upon 7’, 7) of the discrete parameter j. In the mathematical note 
below it will be shown that in fact only two such functions are needed, as 
it is possible to choose the basis |j m> in such a way that 


dj = Cj41, (3.17) 
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J 2j + 3G + INV ID = /27 + KNW + D. (3.17') 
[ We define a new basis 


|hin> = @(j)|j m), (3.18) 


where c(j) is a complex number. The angular-momentum operators 
have the same form (Section III.3) in the |h/,> basis as they do in 
the |j m> basis. In general, the |h/,> are not normalized unless 


(og) | =1. (3.19) 
In this basis, Equation (3.13) becomes 


Voll) = JP = mie, OO hh-t> — may 
= Ga, Danae (3.20) 


We define A; = a;, and we wish to choose w(j) such that 


ox) 0) 
=C; = / Di OCR e 1(5 74 
gad C; and Sr aria j ae ea) 
This will be possible if 
oj) oj ~ 1) 
= Le Beez 
Cj oj = 1) cj) d; 1 ( ) 
Lesa 
27 2f5 dj-1 
w*(j) = w*G — 1) ee (3.23) 
j 
Suppose jo is the smallest value of j in the space, ie., 
Gio — NIV llio> = 9. (3.24) 
Then (3.23) will be satisfied if we choose 
Nic a Gig digi | ja 
OU). Jo Cees, (3.25) 
so (3.20) becomes 


Volhi.> = Jj? — m?C,|hi *> — mA;|hy,) 
= JG + ly — m’C)4,|hit. (3.26) 
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It can be shown that the |h/,) can be normalized in the following 


two cases: 
1. For Hermitian V, (Vi = V_,)wehaveC,;= —C;. _ 
2. For skew-Hermitian V, (Vi = —V_,) we have C; = C;.] 


Thus every Hermitian or skew-Hermitian vector operator can be written 
in the form (3.13), (3.15), (3.16) with (3.17), where the c; are purely imaginary 
or real functions of j. It should be noted, however, that in general we may 
have d; = c;,, for only one vector operator V, at a time. If two different 
vector operators are involved in one problem and the basis has been chosen 
so that for the reduced matrix element of one of them (3.17) holds, then in 
this basis, the other or any additional vector operator is expressed in terms 
of three independent reduced matrix elements. 


Appendix to Section V.3— Representations of the 
Algebras &(SO(3, 1)), &(SO(4)), and &(E3) 


Algebras that are generated by the angular-momentum operators J, and a 
particular vector operator are needed for various problems of quantum 
physics. We have already seen an example of such an algebra in Section III.4, 
where &(E3) was the algebra generated by the angular momentum J, and 
the dipole operator Q;. In Chapter VI we shall encounter the algebra &(SO(4)) 
generated by the angular-momentum operator L; and the Lenz vector A,. 
é(SO(3,1)) is the algebra of the Lorentz group and has various applications 
in quantum physics. 

In this appendix we give a derivation of the representations of these 
three algebras. These algebras are the enveloping algebras of the groups 
$O(3,1) [the pseudo-orthogonal group in (3 + 1) dimensions], SO(4) [the 
orthogonal group in 4 dimensions] and E; [the three-dimensional Euclidean 
group], but we shall not discuss these group-theoretic connections here. 

The defining relations of this algebra are given by 


LH;, H;] = i€; i, Hy, (3.27) 
Geran = ijn Fy, (3.28) 
[F;, F,] = Miéipn Hy; (3.29 7} 


where A? = —I for &(SO(3,1)), 4? = +1 for &(SO(4)), and 42 = O for &(E3). 
Equation (3.27) defines the algebra &(SO(3)) of angular momentum. Equa- 
tion (3.28) specifies that F; is a vector operator with respect to &(SO(3)); 
the F;,i = 1, 2,3, are the Cartesian components. Equation (3.29) then 
specifies that this vector operator together with H, generates one of the 
above algebras. If in addition to (3.27) and (3.28), (3.29) is also fulfilled, then 
the F; are very particular vector operators and its reduced matrix elements 
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have very particular values. We shall now determine these values. We shall 
derive the representations of (3.27)-(3.29) for 47 = —1 and obtain the other 
two cases A? = 1 and A* = 0 by achange of the Hermiticity property and by 
a limiting process, respectively. 

To find all linear representations of the algebra (3.27)-(3.29) means to 
find all linear operators in all linear spaces that fulfill these commutation 
relations. We shall restrict ourselves here to the subclass of those representa- 
tions whose representation space @ contains each of the representation 
spaces # (I = 0, or 4, or etc of &(SO(3)) at most once.® In each #! we have, 
according to Section III.3, the basis 


fn m= —l,—-14+1,..., +1, (3.30) 


and the space # is then spanned by these basis vectors f!,, where / runs 
through a set of values that is to be determined. In other words, 


R=YVO@QR. (3.31) 
I 


If one of the #', say for | = I), were to appear more than once, then f/, 
would not be a basis system of the space &, and a new label (quantum 
number) would be needed to distinguish orthogonal vectors with the same 
value |, and m: f 0". 

The action of the linear operators H, on all f!, is already known from 
Section III.3, and we have now to determine the action of the F, upon the 
f},. We introduce the components [in analogy to (III.3.1)] 


Fo=F;3, Fy, =F, + iF, = + 2F 4. (3.32) 


F, and F,, are the standard spherical components of the vector operator 
F. The commutation relations of the Fp, F, with H., Ho = H; and with 
each other are then 


[F,.. H,] = 90 i Ue cree 
7 (3.33) 
[Ho, Fi] = +F i, [H., Fo] = FF. 
Woes ae | za + Hy eee Fe) = =2LHy. (3.34) 


Equation (3.33) corresponds to (3.28) and expresses the vector-operator 
property of F, and (3.34) corresponds to (3.29) with A? = —1 and specifies 
that this vector operator is a generator of &(SO(3,1)). We now utilize the 
vector-operator property and use the Wigner—Eckart theorem; then we 
obtain for Fy and F, the expressions (3.13), (3.15), and (3.16). If we also 
choose the phase factor of the basis vectors appropriately, as discussed in 
the mathematical note just before this appendix, then (3.17) can be used, 


8 These include all the representations that are connected with unitary representations of 
the groups SO(3,1) and SO(4). 
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and we obtain for the action of Fy and F, on the basis vectors the expres- 
sions 


Fofn = JP — me fe | — mash, — JC + IP —meaife', (3.350) 
Fi fn = JU —m(l-m— Defi —  /U= m+ m+ Dafne 

+ f/l+tm+ D0 + m+ Ves, fit, G35.) 
F_ fin = — JU + m+ m= Deft — (+ md - m+ Daf; 


jC AD CS meee. (3.35_) 


These have been obtained using only (3.33); and every vector operator, in a 
space in which &! appears at most once, has this form. 
The vectors fj, have, according to the results of Section III.3, the property 


H3f,=mf, Hi, = K+ 1)fh, 
H. fin = JF mt m+ Dfbat- 


The f!, have in addition been chosen so that (3.17) is fulfilled, but, due to 
the redefinition (3.18), they are in general not normalized. Let us now see 
when the f/, can be normalized without destroying (3.35). This can be done 
when || f/,|| is independent of |. From (3.35,) one obtains 


(3.36) 


Pole w= = ec are (3.37) 
and 
ae ’, Fofi,) = V ?— me(fin - es (3.372) 
In order that || f/ll = fir tll @e., ||f5| be independent of J), one must 
have according to (3.37) 
ot m a fmlCr = —CO me ota lene (3.38) 
This can be fulfilled in two cases: 
1. Fo is Hermitian, Fh = Fo; then c, must be imaginary, c, = —c*; 
but then it follows by the same argument that a, = a*. 
2. Fo is skew-Hermitian, F} = — Fo; then c, must be real, c, = c*; but 
then it follows by the same argument that a, = —a¥. 


If Fo = F is Hermitian (skew-Hermitian) then by (3.28) all other com- 
ponents of F are also Hermitian (skew-Hermitian), and the spherical com- 
ponents fulfill F4, = F,(F, = —F;), in which case F is called a Hermitian 
(skew-Hermitian) vector operator. Thus the basis vectors f!, can be normal- 
ized in two cases: if F is a Hermitian or if F is a skew-Hermitian vector 
operator. 

So far we have only exploited (3.27) and (3.28); we shall now make use of 
(3.29) or (3.34) to determine the unknown coefficients a, and c, [related to 
the reduced matrix elements by (3.14)]. It is sufficient to use just one of the 


Tensor Operators and the Wigner-Eckart Theorem 149 


three relations (3.34) because the other two are consequences of it and the re- 
lations (3.33). We choose 


[F,,Fo] =H,. (3.34’) 


Applying both sides of (3.34') to the vector f/, using (3.35) and comparing 
coefficients of f1..,, fit,, and f1,,!, leads to the following equations: 


Reel) = (= haa le = 0, (3.39a) 
Lay+1@) + 2) — laje,,, = 0, (3.39b) 
(21 — 1)c? — (21 + 3)c?,, — a? = 1. (3.39c) 


One starts with an arbitrary basis vector f',, about which one assumes 
that it is contained in the particular representation space & that one wants 
to construct. Applying the operators H, to fj, a sufficient number of times, 
one obtains, as described in Section III.3, the whole space &'. Applying the 
vector operators F, to f!, a sufficient number of times, one reaches all the 
other 2’ < # where j differs from / by a positive integer, because F,, changes 
I by 0, +1, —1. Asj > 0, there must always exist a smallest value of j in any 
&R. This smallest j we call ko ; it can be any of the allowed values for j and will 
characterize the space &. If ky is the smallest value of &, then according to 
(3.35) 


Cko = 0. 


If k, is integer, then & contains only integer | > ko, and if kp is half-integer, 
then & contains only half-integer | > ky: 


a= > Ge. (3.40) 


l=ko,kot+1,... 


For all / for which c, # 0 one obtains from (3.39a) 


a(l + 1) — (I — la,_, = 9, (3.41a) 
and for all / for which c,,, 4 0 one obtains from (3.39b) 
4;4,( + 2) — la, = 0. (3.41b) 
Defining 
pr le Day, (3.42) 
these two conditions lead to 
ne a, (3.43) 
Pi+1 — Pr = 9, 


which means that p, is independent of |. We call this arbitrary complex 
constant p, = ikgc, where c is arbitrary complex, and we shall show below 
that the factoring out of ky is always possible. a, can then be written 


ikoc 


as d+) (c arbitrary complex). (3.44) 


a, 


150 Addition of Angular Momenta—The Wigner—Eckart Theorem 


For the case that both c, = 0 and c,,, = 0, it follows from (3.39c) that 
a; = —1, and the matrix elements (3.35) of F,, are completely determined. 
With (3.44) this means that ky = l,c = 1+ 1 and that @ = &. 

It remains to show that the factoring out of ky is always possible, i.e., 
that a, = 0 for ky = 0. But for ky = 0, (3.41a) holds for | = 1 and leads to 
a, = 0. For / = 2 this in turn leads to a, -3 — a, = a, -3 = 0, and proceed- 
ing in this way we obtain a, = 0 for / = 1,2,3,.... For | = 0 we are free to 
choose dy = 0, as then according to (3.35) the factor in front of ay is zero. 
Thus (3.44) holds generally. 

To determine c, we use (3.39c). Defining 


= (21 — 1)(21 + 1)c?, (3.45) 
we can write (3.39c) as 
O, — O14, — (21 + 1a? = 21 + 1, (3.46) 
or using (3.44), 
1 1 
0, — 0141 = (2! AP 1) oa kee? € = wo): (3.47) 


We calculate now for any value k > ky 


k-1 1 
Oe le ie O41) = » Gl 1) — Kec a (p-aa) 


l=ko 1=ko l=ko Uae) 


I 1 


(k? — k3)(K? = c?) 


As ky is the smallest value of /, c,, = 0, and consequently o,, = 0, so that 
one obtains 


(k? — k3)(k? — c?) 


Cyt k2 3 


(3.48) 


and by (3.45), 


i (FP -De-2 
=i f a De nee (3.49) 


Therewith we have found all possible ways the operators H; and F;, that 
fulfill (3.27)—(3.29) act upon the vectors f,,. All these possibilities are charac- 
terized by two numbers (ky, c), where ko is 0,4, 1,3,...,¢ is any complex 
number, and the operators are given by (3.36) and (3.35) with (3.44) and 
(3.49). These are the linear representations of the algebra &(SO(3,1)) that 
contain a given value of the angular momentum / at most once. They will 
be, in general, infinite-dimensional, and only for certain values of c will 
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they be finite-dimensional. For an arbitrary complex value of c, c, will not 
fulfill conditions 1 and 2 below (3.38), the f!, will not be normalizable, and a 
scalar product cannot be introduced in &. 

We shall now find those representations of &(SO(3,1)) for which a scalar 
product can be defined in #. According to conditions 1 and 2, this can be 
done in two cases: 


1. For Hermitian vector operators a, must be real and c, purely imaginary. 
This means according to (3.44) that ic must be real, and according to (3.49) 
that (I? — kg)(I? — c*)/(2I — 1) must be positive. The latter is possible only 
when c? is real, i.e., when c is real or purely imaginary. 

Thus for any integer of half-integer value of ky and any purely imaginary 
value of c, condition 1 can be fulfilled. If c is real, then ky must be equal to 
zero in order that a, = aj‘. Then the expression under the square root in 
(3.49) becomes /?(/? — c?)/(4l? — 1), which will be positive for all | = 
Or 12-5. only ie; =< 1. 

Thus we have seen that condition 1 can be fulfilled in two cases: 


a. Fork, =*0, 4, 1,... atid ic = real, —co < ice < +00 
Db bors, 0, 0 = c < 1. 


In neither of these two cases can c,;,, ever become zero. Consequently, 
according to (3.35), one obtains from any given f!, also an f.,"'. One can 
apply F,. an arbitrary number of times arriving, at ever higher values for 1. 
The representation space # = Ako, c) with (ko, c) fulfilling case a or b is 
infinite-dimensional: 


oO 


Mk, = YL OR. (3.50) 


l=ko,ko+1,-.. 


It is the (algebraic)? direct sum of the irreducible representation spaces 
A (l=ko,k + 1,...) of the algebra of angular momentum, 6(SO(3)). 

2. For skew-Hermitian vector operators according to conditions 2, a, 
must be purely imaginary; this means that c must be real. And c, must be 
real, which means according to (3.49) that /? < c?. Thus the possible values 
of | for a representation space &(ky, c) with c real are bounded, and con- 
sequently there must be a highest value of /, which we call k,. According to 
(3.35) this means that c,, , , = 0, because only then would successive applica- 
tion of F, not lead to higher values of /. This in turn means, according to 
(3.49), that (k,; + 1)? =c? and c = +(k, + 1). Thus we have seen that 
condition 2 can be fulfilled for 


ko =0,4,1,... and c= +(k, +1) where ky = ko, ky + 1,... 
(3.51) 


° It can be completed with respect to various topologies, c.g., with respect to the Hilbert- 
space topology defined by the scalar product, or with respect to the topology defined by the 
countable number of scalar products (@, W/), = (¢, (H? + F* + 1)"W) leading to the nuclear 
space ®. Cf. Bohm [1978]. 
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The representation space 2 = A(ky, +(k, + 1)) is finite-dimensional and 
given by 


ky 
Mk), thi+ I= Y COM (3.52) 
l=ko, kot+1,... 

Though A(ky, (k,; + 1)) and A(ky, —(k, + 1)) are the same direct sum of 
irreducible representation spaces of &(SO(3)), the F,, act differently in these 
two spaces, as can be seen from (3.44) and (3.35). They are inequivalent repre- 
sentation spaces of &(SO(3, 1)) -with the same reduction with respect to 
&(SO(3)). 


Summarizing our results, we have found that the irreducible-representa- 
tion!® spaces of &(SO(3,1)) [the algebra defined by (3.27), (3.28), (3.29,2— — 1), 
for which H, and F; are Hermitian] are characterized by two numbers 
(ky, c) that can take any of the values in case a or b. The reduction of this 
representation space A(ky, c) with respect to the algebra of &(SO(3)) is 
given by (3.50), and the operators H, and F,, are given by (3.36) and (3.35) 
with (3.44) and (3.49). 

The irreducible representation spaces of €(SO(3,1)) with skew-Hermitian 
F, are characterized by the two numbers (ky, c) = (ko, +(k, + 1)) which 
can take any of the values (3.51). The reduction of this representation space 
with respect to &(SO(3)) is given by (3.52), and H, and F,, are again given 
by (3.36), (3.35), (3.44), (3.49). 

It is now easy to obtain the representations of &(SO(4)) and 6(£;). To 
obtain the representations of &(SO(4)) we define 


A,=-iF, A,=—iF, (*«=0,+,—). (3.53) 


If F, fulfills the commutation relation (3.29,.- _,) then A; fulfills the com- 
mutation relation (3.29,2- ,,), Le., it fulfills together with H; the commuta- 
tion relations of &(SO(4)): 


[A;, Aj] = i€ jj Hy, (3:29 22) 


If F; are skew-Hermitian (F{ = —F,), then A, are Hermitian (At = A)). 
Therefore we conclude that the Hermitian irreducible representations of 
&(SO(4)) are characterized by two numbers (ky, +(k,; + 1)) that can take 
the values (3.51). The irreducible-representation spaces are all finite-dimen- 
sional and given by (3.52). The operators A, are again given by (3.35) with 
(3.44) and (3.49), where F,, is given by (3.53). 

The algebra &(SO(3,1)) has two invariant operators, i.e., two operators 
that commute with all generators H; and with F;. They are 


CG. — i C,=F-H. (3.54) 


‘ The irreducible representations are also called ladder representations because they are 
obtained by climbing from one value of / to the next. 
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As they commute with all H, and F, their eigenvalues cannot change by 
applying H,, F,, to the corresponding eigenvector. Therefore their eigen- 
values must be the same in the whole irreducible representation space. One 
can calculate these eigenvalues applying C, and C, to a suitable chosen 
fi, using (3.36), (3.35) with (3.44) and (3.49). The result is 


Ci finlko, ¢) = (—k5 — c? + 1) fhilko, ©), (3.55) 
Crfinlko, C) = ikocf (ko, ©). (3.553) 

The two invariant operators for 6(SO(4)) are 
Oe = At + He, OAH (3.56) 


with the eigenvalues 


CP finlkos £(ky + 1) = (ko +c? — Df ilko, (ki + 1) 
= (ko + ki + 2k) fn(ko, #(k1 + 1), (571) 


Cr fits: +(k, + I)= koch (ko, (Ay tL) 
= tkolk, + I) filko, +(k1 + 1)). (3.57,) 


Thus, the eigenvalues of the invariant operators specify the irreducible 
representation completely; it is, however, more convenient to characterize 
the irreducible representations by the values of (ky, c) or by the values of 
(ko, k,, +), where + refers to the two inequivalent representations with 
the same value of ky and k,. 

We finally mention the special case where ky = 0 or C, = 0. This is the 
case one encounters most frequently in the application to simple quantum- 
mechanical systems. We shall meet this case for &(SO(4)) again in Chapter 
VI in the derivation of the hydrogen spectrum. It is the case that does not 
require parity doubling, as we shall see below in Section V.4 [in particular 
Equation (4.25)]. We have encountered it already in Section III.4; it leads 
to a spin spectrum given by (III.4.2) and depicted by Figure II.4.1. For 
ky = 0 it follows from (3.50) for Hermitian representations of &(SO(3,1)) 
that 

A200: — > OX, (3.58) 


1= 0,1)... 


and from (3.52) for Hermitian representations of é(SO(4)) that 


n~1 
ROk, +1=n= > 2 Ciestlee, 3, 22) (3.59) 
t=0,1,... 
From (3.44) follows that for this case 
a,=0 forall! (or <I||F ||) = 0) (3.60) 


and the F, and A, change the value of | by +1. Therefore, the representa- 
tions (0, k, + 1) and (0, —k, — 1) are equivalent. 
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We shall now derive the irreducible representations of &(£3) from the 
irreducible representations (ky, c) of &(SO(3,1)) by a limiting process, 
called Inénii-Wigner contraction. We define the operators 


P, = AF ;, J; = H,. (3.61) 


If H; and F; fulfill the commutation relations (3.27, (3.28), (3.29,2__,) of 
&(SO(3,1)), then P; and J; fulfill the commutation relations 


(J3. Jc) =e; (3.62) 
[J,, Pp] = tex Pr, (3.63) 
Wee) ave en, (3.64) 
and P; depends upon A. The invariant operators are now 
A6C, — Ppa) oe (3.65) 
AC, = P,J;. (3.66) 


In the limit A* + 0 the commutation relation (3.64) goes into the commuta- 
tion relation 


[P;, P,] = 0. (3.67) 


(3.62), (3.63), (3.67) are the commutation relations that define the algebra 
&(E,). Thus in the limit A? > 0 the commutation relations of &(SO(3,1)) 
go into those of &(E3). However, if we simply take the limit A > 0, then we 
see from (3.61) that P; + 0. In order to obtain the algebra of operators of 
é(E;) from the algebra of operators &(SO(3,1)) in A(ky, c) by the limit 
A + 0, we must simultaneously increase the F,, so that P, does not go to 
zero. This can be done, as seen from (3.35), (3.44) and (3.49), by increasing 
the value of |ic| > oo when A > 0. The contraction in the representation 
is therefore performed in the following way: 


A>0 |ic| + co such that icA > ¢€ (3.68) 


where ¢ is a finite real value. From (3.35), (3.44), and (3.49) we see that e.g. 
P, is then given by 


P3f, = lim AF of}, 


icA>e 


= /P — me fi! — maf, — J+ 2? — m?e4,f'-', (3.69) 


oe _ bt (PRP =e) 3 (PH 
é, = lim Ac, = lim f Ae = =e ea (3.70) 


= Aicky __ikge 
d+) +17 


where 


and 


a, = lim Aa, = li 


(3.71) 
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The invariant operators of &(£3) are P;P; and P,J;. From (3.65) we see that 
in this limiting process 


P;P; = lim(A2C, SF aed J) = lim( — kA? = A*c* + At) 4. 7 ca eb = ee. 


(312) 


where we have used (3.55,) for the irreducible representation (kg, c). In the 
same way it follows from (3.66), using (3.55,), that 


I eae — lim Ac2 = lim hick, = ky. (3.73) 


The representation of &(£3) obtained in this contraction process (3.68) is 
therefore characterized by the two parameters (ky, €) with — 00 <€ < +00. 

The &(SO(3)) subalgebra and the reduction with respect to this sub- 
algebra are not affected by the contraction process (3.68). Therefore the 
irreducible representation space A(ky, 6) of 6(E3) obtained from the rep- 
resentation space A(ko, c) is again given by [cf. (3.50) ] 


Rko, 2) = » ® #. (3.74) 
l=k3,k3+1,... 
Summarizing our results, we have found that the Hermitian irreducible- 
representation spaces of 4(E;), defined by (3.62), (3.63), (3.64), are charac- 
terized by two numbers (kg €) which take the values ky = 0,4, 1,..., —00 < 
€ < +0. The reduction of this space with respect to &(SO(3)) is given by 
(3.74), and the operators J, = H,, and P,, are given by (3.36) and (3.35) 
with F,. replaced by P,. and a,, c, replaced by G,, and ¢, given in (3.70) and 
Cr) 
The basis system f!, of the representation space A(ky, ©) is the one in 
which the following complete system of commuting operators is diagonal: 


Paeely 7; Eph), P;4;. (3:79) 
As the P; commute [Equation (3.67)], one usually chooses a basis system 


of the representation space in which the P; are diagonal. As a complete 
system of commuting operators one chooses therefore 


| rg end ne are (3.76) 
with the corresponding basis vectors |p;, ky >, which have the property 
Cko Pi |Piko> = Skok, 5 (P — P’), (377) 
P;|piko> = PilPiko> P;J;|Piko> = koe|Piko>. (3.78) 
€? = PiDj. 


Without a derivation,'! we mention that the transformation matrices 


from the basis f/,(kg, €) to the basis | p;ko>, 
IPiko> = Y Snlko, <€kKolm|piko>, (3.79) 
I,m 


11 A derivation is given in Appendix I of A. Bohm, R. B. Teese, Spectrum Generating Group 
of the Symmetric Top Molecule, J. Math. Phys. 17, 94 (1976). 
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are given by 


att Dials 8 — 4) (3.80) 
4x |e| 
Here (¢, 0) are the spherical coordinates of p; p, = |e|sin @ cos ¢, p2 = 
|e| sin 0 sin , p3 = |€| cos 0; and D},,,(, 0, —¢) is the rotation matrix.’? 
In this appendix we have derived a huge class of irreducible representa- 
tions of 6(SO(4)), €&(SO@G,1)), and &(E;), many of which are used for ap- 
plications in this book or for applications to other problems in quantum 
physics. We have used here the same method that was used in Section III.3 
for the derivation of the representation of &(SO(3)). Though group theory 
has not been introduced, we want to mention that these are all the unitary 
representations of the groups SO(4), SO(3,1) (the homogeneous Lorentz 
group), and E, (the Euclidean group in three dimensions). 


<ékglm|piko> = 


V.4 Parity 


Parity is a very important observable in quantum physics, as all quantum- 
mechanical systems are usually in states that are eigenstates of parity. The 
parity operation P, which is also called space inversion, is the operation of 
taking the mirror image. If a physical object has the position x; and momen- 
tum p;, then its mirror image has the position x*® = —x,; and momentum 
p® = —p;. For a quantum physical system this transformation P is, ac- 
cording to the basis assumption I, represented by a linear operator in the 
space of physical states. We call this operator the parity operator Up. 

Some properties of Up follow directly from the physical interpretation 
of the parity P; others are convention. As we shall justify below, Up is re- 
quired to fulfill the following defining relations: 


Up9;Up'=-Q, (i= 1,2, 3), (4.1) 
Up Us = eG = (4.2) 
Und, Us a dy (4.3) 


where Q;, P;, and J; are the position, momentum, and angular-momentum 
operators, respectively. A further requirement on U>p is 


Us ORO | (4.4) 
and 
Ue (4.5) 
A consequence of (4.4) and (4.5) is 
UL =e, Ui aU a, Ue (4.6) 


'? The rotation matrices can be found in, e.g., L. C. Biedenharn and J. D. Louck [1979], 
Chapter 3. 
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Linear operators that fulfill (4.4) are called unitary operators;'* thus Up is 
unitary and according to (4.6) also Hermitian. 

Equation (4.3) follows from (4.1) and (4.2) if J; = L; = €;,.Q;P,; for other 
angular momenta it is a postulate. 

From (4.1), (4.2), and (4.3) we obtain the relation of Up with any ob- 
servable that is a function of momentum, position, and angular momentum. 
When new independent observables are introduced, their relations with Up 
will have to be postulated. 


The parity provides a classification of tensor operators. A tensor operator 
T™ for which 


tet Up (tyr (4.7) 
is called a (proper) tensor operator; if 
Ces = =(-1 IY, (4.8) 


then T“ is called a pseudotensor operator. Thus the operators Q; and P; are, 

according to (4.1) and (4.2), proper vector operators, whereas the operator 

J; is a pseudovector (also called an axial-vector) operator, according to (4.3). 
All energy operators that we have met so far fulfill the condition 


UpHU;'! = H; (4.9) 


ie., they are proper scalar operators. Equation (4.9) is, however, not uni- 
versally fulfilled; the part of the energy operator that causes the weak decay 
is known not to fulfill (4.9) (parity nonconservation). 
In the following we shall give some justification of the above postulates: 
Let w be a vector in the space of states #, and denote by W* the vector we 
obtain from wy by applying Up: 


wR = Up. (4.10) 


Let us consider the pure physical state of which yw is the representative 
vector, i.e., the state A,, which is the projector on the one-dimensional 
space spanned by w. Let us denote by Aj the projector on the space spanned 
by the w®, and let us make the “physically reasonable” assumption that 
this is again a pure state (i.e., the y* span a one-dimensional space), w® is one 
of the vectors that represent the state obtained from A, by a space reflection, 
because Up is an operator that represents this space reflection. Thus AY 1s 


the space-reflected state of A,, and these two states are connected by 
AR = |P®>CWR| = Uplp><wlUb = UpA,U}. (4.11) 


As a generalization of (4.11) we obtain (cf. (11.4.31)] that for any arbitrary 
state W, 


w® = UpWu} (4.12) 


13 See also Section XV.3 for the definition of a unitary operator. 
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will describe the state that is obtained by a space reflection of the state W. 
When W is normalized (Tr W = 1), the space-reflected state should also 
be normalized (Tr W® = 1), or more generally, 


Tr W® = Tr W, (4.13) 
1.€., space reflection should not change the probability. Thus 
TrW =Tr(U,wvys) = Irw ULU,), 


which is fulfilled if one requires (4.4). 
If one takes the mirror image ‘of an object two times, then one gets the 
original object back. Therefore 
W=(W*)* = U,W®U] = U,U,WUBUs. (4.14) 


This is fulfilled if one requires (4.5), which is sufficient to ensure (4.14) but 
not necessary for (4.14); ie., (4.5) is also partially a convention. 
Let x; be the expectation value of the position operator of a quantum- 
mechanical system in the state W, 
x; = 1r(Q;W); 


and let xf be the expectation value in the state Wer of the quantum- 
mechanical system obtained from the state W by space reflection, W® = 
Cao. 
xf = Tr(Q;W*). 

Then, because of the physical meaning of space inversion, 

=o (4.15) 
1.€., space reflection changes the value of xX; into the value —x,. Thus we 
should have 

Tr(Q;W)= —Tr(Q;W*) = —Tr(Q,UpWU}) 
= —Tr(ULO;UpW). (4.16) 


This is fulfilled if 


Q; = UbQ; Up, 
or, using (4.4), if 


UpQ,Up' a Or 


which is the requirement (4.1). The other defining relations (4.2) and (4.3) 
can be justified by similar arguments. 

We have therefore justified the defining relations (4.1)- (4.5) of the operator 
that represents the observable parity by showing that these relations lead 
to consequences that we expect of the physical operation of space reflection. 
Thus U, defined by (4.1)-(4.5) can represent the observable parity. We have 
not, however, shown that this Up is the only operator that can represent 
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parity. (A few remarks concerning this question are given in Section XIX.2, 
in particular in the Appendix of that section.) 

We shall now study the properties of the parity operator Up in angular- 
momentum eigenstates—for example, as they occur in the rotator space 
(IIT.4.2): 


R= OR. (4.17) 


As the system of commuting observables 
a feet (4.18) 
commutes with U>, it follows that 
teen (4.19) 


is also a system of commuting observables. We have to distinguish two cases: 
(a) that (4.18) is already a c.s.c.o.—as is the case for the rotator—and (b) 
that (4.19) is the c.s.c.o. This latter case is called parity doubling. 

Case (a): If (4.18) is the c.s.c.o., then |jj3> is a basis system. Because of 
(4.3) it follows that 


Uplijs> = Rijs)» ~ (4.20) 


[x # 0 because of (4.4)], ie., |jj3> are eigenvectors of Up. The eigenvalue 
nm = n(j,j3), which we call the parity of the state |jj3;>, could in general 
depend upon j, j;. We will now show that 


mj, js) = (—1)'n, (4.21) 


where n, called the intrinsic parity, is independent of j and j,;, and where 
nla 
From (III.3.22) and (III.3.23) it follows that'* 


UpJ+\iis> = Up/iGi + 1) yaGa tL) fje et? 
= mj, is + DJG + 1) —saGs t Did is + D. 
Because of (4.3) the left-hand side of this must be equal to 
J,Upliis> = 1G, j3)J + iis? —, 
= m,is/iG + D — isis + Diiis + 1. 
Comparing the last two equations, we conclude that 


nj, js) = 2G, js + 1), (4.22) 


14 In this and succeeding sections we shall use units such that h = I. We shall on occasion 
restore the f’s to an equation, in which case we are using the usual (Gaussian) c.g.s. units. 
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Le., that 2 is independent of j,. To find the dependence of z upon j, we have 
to use (4.1) and the vector-operator property of Q;. Writing 


sa) ’ 0.29 om Q;, 
and using (3.12),we have 


OnJs7 = = Lis + egal — 1j, $e Op 
+ lijs + kK) Gislklijs + K> GIlQI/> 
+ ij + lige Ojsinyer 17; + eG + 1Olp>. (4:23) 
where ¢j'||Q||j> are the reduced matrix elements of Q. If we apply U, to both 
sides of (4.23), then from (4.1) it follows that 


Q,.Uplijs> = TMU)Qxliis> = —UrpQx\iis> 
= P= 1j5 + eng — YGiiky = js eG — Nel 
— Wis + KM) Gi3 1 klijs + KD GQ 
SU t ljs + eoag + DGjs lxlj Palys DG 10l7>- 
(4.24) 


If we insert (4.23) into the second term of this equation and compare it with 
the last term we conclude 


GIIQIi> = 9, (4.25) 
mj —1)= —aG) if G— NOM 9, (4.26) 
mj+1)= —n() if G+ 1/Ollp> #0. (4.27) 


According to (3.17’), 


V4 + 3G + QI = J27 + 1GIOli + 1, (4.28) 


so if <j|Q\|j + 1> = 0 for every j, then this and (4.25) would mean that OQ. 
is the zero operator. Thus for some value(s) of j, <j|/O||i + 1) # 0; for all 
such values we have by (4.26), (4.27) 


nj) = —n(j + 1). (4.29) 
From this it follows that 
mj) = (—1)n (f= const). (4.21) 
To determine what y can be we use (4.5): 
UpUplijs> = m@)nQ)lijs> = Tijs>, (4.30a) 


and (4.4): 
Gislllijs> = Gisl|UbUpliis> = RNG i3li3>. (4.30b) 
Thus 
nj)n(j) = 1, RU) (4.31) 
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and consequently z(j) = z(j), so z(j) is real and x(j)* = 1, ie., x(j) is +1 or 
is —1. Thus 


Hh=+1 or n= -1. (4.32) 
We can now give a complete justification for the selection rule (III.4.10): 
Ait. (4.33) 


It is a consequence of the fact that Q, is a proper vector operator. 
From the property 


[J x. Ot) = ttm Om (k, l,m = 1, 2, 3) (4.34) 
it follows that Aj = +1, 0 [see Equation (3.7)]. From (4.25), i.e., the property 
(4.1) of the Q; together with the assumption that there is no parity doubling, 
it follows that Aj # 0, which establishes ITI(4.10). 

Case (b): If (4.19) is the c.s.c.o., then the parity 2 is an additional label 
for the basis vectors: 

Uplijzs™> = 1\jJ3 ™). (4.35) 

The question now is what is the spectrum of Up, 1.e., what are the possible 


values of z. From (4.4) and (4.5) it follows in the same way as above that (4.31) 
must hold. Thus 


eel — — 1. (4.36) 
Instead of (4.23) we now have 


O,liism> = li — lis +k, -mGis 1 Klj—1js +g — 1, —allQiim 
+ |jj3 + K, —™MGj3 1 klijs + KG, —AI1Q| m 
+ lj+1j3+K, —mGjzl «lft lis; + eX + 1, —x/lQl x), 
(4.37) 
where in addition to the vector-operator property (4.34) of Q, we have 
taken (4.1) into account, i.e., that Q, changes the eigenvalue of Up, and also 
(4.36). Thus unlike case (a), (4.1) with (4.36) does not lead to any conditions 
on the reduced matrix elements <j’ 2'|\Q\|j > and parities. Consequently 
for every choice of j, j; there are two vectors 
fige—+1> and |fjzx7= —1>. (4.38) 


That is the reason case (b) is called the parity-doubling case. 
It is customary to obtain an analogy with case (a) by labeling the states 
not with z but rather with the intrinsic parity n, which is defined by 


Urliism> =n)" liisn> = (4 = £0). (4.39) 
The parity 7(j) in a state with angular momentum j and intrinsic parity 4 
is then given by 


mj) = (—1)n. (4.21) 
The selection rule is now seen from (4.37) to be 
Aji = 1,9, (4.40) 


An = —1. (4.41) 
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Equation (4.41), which is a consequence of (4.1), was also fulfilled in case (a), 
only there it was already implicit in the selection rule (4.33) as a consequence 
of (4.21). 

If parity is an independent observable (case (b)) then, according to (4.38), 
for every value of j we have two angular momentum spaces 


Ri and A. (4.42) 


As Q,. transforms from a given value of j” to (j + 1)”, j", and (j — 1)”, one 
obtains all angular-momentum spaces unless there is a j, such that 


<jo — I nllQllion> = 0 (4.43a) 
or aj, such that 


Gi + Enlil n> = 0. (4.43b) 


That is, jp is the smallest value of j that can be reached by repeated applica- 
tion of Q,, and j, is the largest. The space of all states may then be written 
Jt di 
R= yi OA-+1,0 2 ® #--1 = Ay=+1DA,--1- (4.44) 
melo! TG 

Are there physical systems in nature whose space of physical states is given 
by (4.44)? 

When we considered the rotating diatomic molecule, we used as its 
classical picture the dumbbell, ie., we assumed that the amount of inertia 
about the line joining the two atoms was zero. However, because there are 
a number of electrons revolving about the two nuclei, a better classical 
model in many cases is a dumbbell with a flywheel on its axis. Thus in many 
cases the diatomic molecule is not a simple quantum-mechanical rotator 
but a quantum-mechanical symmetric top; the total angular momentum 
jis no longer perpendicular to the direction of the figure axis a = x/|x], but 
instead has a constant component in the direction a, ie., 


j> a = Const, (4.45) 


due to the revolution of the electrons. 


Figure 4.1 Vector diagram for the symmetric top. The curved arrow 
indicates the rotation of the whole diagram about j. The dashed part of 
the figure gives the vector diagram when the sense of the direction of a 
is reversed. 
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A classical nonrelativistic symmetric top (Figure 4.1) is a physical system 
that has constant moments of inertia J,, [,, I, about its principal axes. It 
has angular velocity and angular momentum j, and the axis of symmetry 
lies along the direction of the unit vector a = x/|x]|. Its energy is given by 


E = 3)-@. (4.46) 
@ may be expressed in terms of j and a by using 
, I,-1 Pee 
j= I,0 + +—4 «.-jx—. (4.47) 
I, 4 : 


From Equations (4.46) and (4.47) one obtains the classical expression for 
the energy in terms of j and a: 


Lore I,—TI, 1 
E =—(y --— 5 4 :j)’]. 4.4 
oI, f Tae (x-j) (4.48) 
For the quantum-mechanical system, the numbers j; and x; are replaced 
by the operators J; and Q;. The energy operator of the quantum-mechanical 
symmetrical top is therefore given by 


een? ae te 
is 


I, (Q- 7) (4.49)'5 


The Q, and J; fulfill the commutation relations: 


[L;, L;] = ihe; , Ly [L;, Qj] = ihe; j.Qy LQ;, Q;] = 0 

(4.50) 

It has been shown in Appendix to Section V.3 that as a consequence 

of these commutations relations, which are identical with (3.62) 

(3.63) and (3.67) the following equations (identical to (3.72) and 

(3.73)) are fulfilled: 

Q’ = Q;0; = x” = const, 
Q-J = Qi = Jox, 
where x = +|x| and jy is the smallest angular momentum in &%, 
defined by (4.43a). The matrix element of Q; are given by (4.37) 


with the reduced matrix elements (determined from (3.14) (3.17) 
(3.70) and (3.71)) given by: 


Gin'Ollin> = Fi aN jane (4.52) 


(4.51) 


a 32.2 112. 
G = | n Oly n> al - ("| Onn (4.53) 
(ji + 1)? — jo)x? 
Ga lyey > 3) 


1/2 
G+ 1y'lQllind = -| Oy. (4.54) 


15 1 /Q? is the operator defined as the inverse of Q?:1/Q? = (Q’) '': see [M] in Section VL.3. 
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The y-dependence follows from (4.1) and (4.3). These reduced 
matrix elements, together with (4.37) and the usual matrix elements 
of J, [as given by (III.3.8’), (III.3.22), and (III.3.23)], constitute a 
representation in & of the enveloping algebra &(£;) of the three- 
dimensional Euclidean group extended by parity. When jy = 0, then 
<j\|Q\;> = 0 and the space is given by (4.17). Equation (4.54) shows 
that j can become arbitrarily large, i.e., j, = 00. 


The energy spectrum of the symmetrical top, i.e., the matrix element of 
the energy operator H in the basis | jj, n> of Z, is obtained from (4.49) using 
(4.51): 


1 ly al 
Spectrum —— 1 ee (4.55) 
ay I 


A 
G=A,A+4+1,A+4+2,...), (4.55a) 


where A = jo is the conventional notation used in molecular spectroscopy. 
The angular momentum spectrum (4.55a) follows from (4.44) or (3.74). 
It is also intuitively (classically) clear, as the value of total angular mo- 
mentum j must always be larger than its component A in the direction 
of Q. The component along the direction Q (axis of the molecule) can take 
the two values +A = jox/|x| (the — sign corresponding to the dashed 
line in Figure 4.1). As 


0;J;Up a —U,pQ,J;, (4.56) 


parity states are not eigenstates of the operator Q;J; that represents this 
component. Thus in parity eigenstates the molecule does not have definite 
values for the componeft of angular momentum along the molecular axis. 

From (4.55) it follows that the energy levels of the symmetric top, in which 
the component of angular momentum along the top axis is fixed, are the 
same as those of the simple rotator except that there is a shift of magnitude 
proportional to A’, and except that levels with j < A are absent. Each energy 
level has, in addition to the usual (2j + 1)-fold degeneracy, a twofold de- 
generacy because of the two possible values of 7 = +1. The energy diagram 
of this kind of symmetric top for the case A = | is shown in Figure 4.2, with 
that of the rotator for comparison. The + or — sign gives the values of the 
parity x. The levels witha given value ofj and = +,whichaccording to (4.55) 
should be degenerate, are drawn slightly separated. The selection rule 
Aj = +1,0 has already been derived. Thus for these kinds of molecules, in 
addition to the R and P branches given by (III.5.19), there also appear in the 
vibration-rotation bands transitions with Aj = 0, Le., a series of lines given 
by 


Wa) = 7 (B,, _ By )j ate (B,, = Bj (4.57) 


j 
5 - 5 ae 
St 4 es 
3 - 3 - 
2 + 2 + 
1 - 1 
0 ————- + (0)---------- 

(a) (b) 


Figure 4.2 Energy diagram (a) of the rotator and (b) of the symmetric 
top. For the symmetric top A = 1 is assumed. The dashed level with 

J = 0 therefore does not occur. [From G. Herzberg Vol. 1 (1966) 

with permission. ] 


Ft ie 


Figure 4.3 Energy level diagram for a band with P, Q, and R branches. 
For the sake of clarity, in the spectrogram below, the lines of the P and 
R branches which form a single series are represented by longer lines 
than those of the Q branch. The separation of the lines in the Q branch 
has been made somewhat too large in order that the lines might be 
drawn separately. The convergence in the P and R branches 1s frequently 
much more rapid than shown. [From Herzberg (1966), with permission. ] 
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(Here and also in the formula (III.5.19) for the R and P branches, vg is not 
the oscillator’s wave number but 


vo. = [a = T)/I 4] (By Ady = By Ady). 


But as A,,,, and A,,») are constant, this last term can be absorbed in a suitably 
redefined Ay.) This series of lines is called the Q-branch. The energy-level 
diagram and the transitions for the case that the upper rotational band has 
the value A = 1 and the lower band the value A = 0 are depicted in Figure 
4.3. 

Aj = 0 transitions are not observed in the infrared spectrum of diatomic 
molecules. Therefore A = 0 for the electronic ground state of the diatomic 
molecules, i.e., the motion of the electrons in the electronic ground states is 
such that no angular momentum about the interatomic axis results, so the 
diatomic molecule in the ground state is a dumbbell. However, all three 
branches have been observed in transitions between different electronic 
levels of diatomic molecules. Therefore in excited electronic states A is in 
general different from zero but has a constant value as long as the electronic 
state does not change. Thus in an excited electronic state, the diatomic 
molecule is a dumbbell with a flywheel rotating around its axis. Figure 4.4a 
shows an example of the energy levels and transitions between an electronic 


(a)— 


(s)+ 


(a)— 
(s)+ 


(a) (b) 


Figure 4.4 Energy-level diagram for the first lines of (a) a 'TI — 'X* 

transition and (b) a ‘TI — * transition. For the sake of clarity, the 

A-type doubling in the ‘TI state has been greatly exaggerated. The 

broken arrow to the left in (a) gives R(2)-Q(2), and the one to the right 

gives Q(3)—P(3). Their difference gives the sum of the A doublings for J = 2 and 
J = 3 in the upper state. [From Herzberg (1966), with permission] 
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state with the electronic angular-momentum component A = 0 and in- 
trinsic parity 7 = +1('Z*), and an electronic state with A = 1 ('ID). 
Figure 4.4b shows the same for an electronic state with A = 0 and n = 
—1('Z”) and the 'IT state. The splitting between the two “degenerate” 
levels of opposite parity, which was indicated in Figure 4.2, and which should 
not be there according to (4.55), does in fact occur. It is a consequence of 
the interaction between the electronic motion and the rotation of the molecule 
(A-type doubling), which was not taken into account in the derivation of 
(4.55). 

The transition frequencies given by (4.57) are in the visible or ultraviolet 
region, aS vo is in this region for transitions between different electronic 
states. 


Problems 


1. The wave numbers of the spectral lines corresponding to the transitions 
between two electronic states are given by 


Af) ee C9 | 
v= Var Vays 


Here the v‘ are the term values of the vibrating rotator in the electronic state (e) and 


nl 


are given by 
VO = T, + wn + 4) — w, x(n + 4)? + Oyen + 2)? + By + 1) 


where T, is a constant for a given electronic transition and the remainder follows as for 
Equation (III.5.33) the only difference being that the above formula includes a third 
order term in the vibration and no second order term for the rotation. The vibrational 
and rotational constants are different for different electronic states (e). In general the 
selection rule An = 1 does not hold for vibrational transitions between electronic 
states. 

Figure PS.1 is the spectrum of the emitted radiation obtained by exciting N, with a 
beam of Ne*. The spectrum contains transitions due to both N, and N;. The peaks 
labeled A and B represent transitions due to N;. Neglecting the rotational structure, 
determine which electronic and vibrational transitions peaks A and B correspond to 
by using the table of data on Nz (Table PS.1). 

(a) Locate the R and P branches and the null line. 

(b) What does the large peak in the rotational spectrum at about 4280 A represent? 

(c) Why does this rotational spectrum look significantly different from the CO 

spectrum in Figure III.1.5? 

(d) What are the two large peaks at about 4239 A and 4242 A? 

(e) Calculate the spectrum for this rotation band using Table PS.1, and determine 

which peaks in the rotation spectrum correspond to which /-values. 

(f) Compare your calculated spectrum to the spectrum measured from Figure 

PS.2. If there are any differences, explain why. 
(g) Why do the peaks in the rotation spectrum alternate in intensity in a 2:1 ratio? 
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Wavelength (A) 


7500 8000 8500 
Wavelength (A) 


Figure PS.1 Spectrum of N, and Nj obtained in N,—Ne* collisions, 
from J. R. White, dissertation, University of Texas at Austin (1971). 
with permission. Figure PS.2 is an expansion of the spectrum at B, 
which clearly shows the rotational band of N}. 
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4230 4240 4250 4260 4270 4280 
WAVELENGTH (A) 


Figure PS.2_ Expansion of spectrum of Nj at point B of Figure PS.1, 
from J. R. White (1971) with permission. 


2. Determine the energy levels of the (isotropic) three-dimensional harmonic 
oscillator and the degree of degeneracy of each energy level. Determine the possible 
values of the orbital angular momentum in the nth energy level. To do this, proceed 
as follows: 

(a) Use the basic assumption IV to obtain the space of physical states and the 
product basis from the combination of three one-dimensional oscillators. In 
this product basis the energy operator H = P2/(2m) + (mw? 2)Q? is diagonal. 
Use this fact to obtain the spectrum of H and the degeneracy of its eigenvalues. 

(b) Introduce the angular momentum operator L, = €:x.Q;P, and the cs.c.o. 
H, L?, and L;. This c.s.c.o. is not diagonal in the product basis obtained in (a). 
Introduce a basis of eigenvectors of the c.s.c.o. H, L?, and L, (the angular 
momentum basis), and determine the transition coefficients between the product 
basis and the angular-momentum basis. Determine the possible values of / 
for a given energy level E,,. 


CHAPTER VI 


Hydrogen Atom— 
The Quantum-Mechanical Kepler 
Problem 


The classical Kepler problem is described in Section VI.2, where the Lenz 
vector is introduced. Section VI.3 gives the algebra of angular momentum 
and the Lenz vector, and Section VI.4 describes the representation of this 
algebra. In Section VI.5 we present the algebraic derivation of the hydrogen 
spectrum and discuss, at the end, fine-structure effects. 


VI.1 Introduction 


An atom is described in everyday language as a nucleus with positive charge 
+Ze and Z electrons, each with negative charge —e, that move around the 
nucleus in fixed, stable orbits. The simplest atom 1s the hydrogen atom, which 
—in this language —consists of the proton as the nucleus and one electron 
moving around it in the field of the Coulomb electrostatic force of the proton. 
Thus the hydrogen atom is considered to be the realization of the quantum- 
mechanical analog of the classical Kepler problem. 

The quantum-mechanical Kepler problem was first solved by Pauli (in 
1925) using the Heisenberg commutation relations. Later it became custom- 
ary to treat the hydrogen atom as a solution of the Schrodinger equation 
for the 1/r potential, which was introduced in 1926. Our presentation is based 
on Pauli’s and Bargmann’s treatment.’ 


''W. Pauli: Z. Phys. 36:336, 1926; V. Bargmann: Z. Phys. 99:576, 1936. 
wal 
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To conjecture the algebra of observables for this problem we shall follow 
our general procedure for quantum-mechanical systems that have classical 
analogs and obtain the relations between the quantum-mechanical observ- 
ables by correspondence from the relations between the classical observables. 
We therefore recall some facts about the classical Kepler problem. 


VI.2 Classical Kepler Problem 


Let x; be the coordinates of the electron (with charge —e), and let p; be its 
momentum components. Let the proton (with charge +e) be located at 
the origin of the coordinate system. Then the Coulomb force between 
electron and proton is given by 


where 


Rex) cd hws ry 


and the potential energy (Coulomb potential) is given by 


2 


V(r) = — = 


[Here the charge e, the system constant, is measured in electrostatic units: 
e = 4.803 x 1071° esu; 1 esu = 1 cmdyn’/? = 1 g'/? cm?/? sec™1. If e is 
measured in the practical unit of charge, e = 1.602 x 10°! coulomb, then 
F is given by |F| = (4x€)(e?/r?) with fae = 8.99 x 10° kg m? sec”? C~ 7}. 
Therefore the energy of the electron is given by 


24 2 
= ae 
2, ae au) 
where m is the mass of the electron. [The electron mass m, = 9.11 x 1073! kg 
is much smaller than the proton mass m, = 1836m,; if one wants to take 
into account that m, is not infinite, then m in (2.1) is given by the reduced 
mass 


—the reduced momentum. ] 
Besides the energy E, there are two other constants of the motion: the 
angular momentum 


l=x x p, (2.2) 
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P 


Figure 2.1 


and the Lenz vector [or Runge-Lenz vector; cf. Barger and Olsson (1973) ] 


+—x, (2.3) 


The closed orbits of the classical Kepler problem are ellipses (Figure 2.1). 
] is an axial vector that is perpendicular to the plane of the orbit. As lis a 
constant of the motion, the orbit lies in some plane through O. For the 
Kepler problem, V = e7/r, not only is the plane of the orbit fixed, but the 
orbit itself is also fixed, 1.e., the major axis PA is fixed and the ellipse does not 
precess. The constant of motion that is used to characterize the orientation 
of the major axis in the orbital plane is the Lenz vector 4, which is directed 
along the major axis from O to A. 

The constant of motion 4 is a special feature of the 1/r potential. Whereas 
l is a constant of the motion for every spherically symmetric system [1.e., 
for V(x) = V(|x|)], 4 is a constant of the motion only for V(x) o I/r; a 
small deviation of the potential from this form causes the axis PA of the 
ellipse to precess slowly in the plane perpendicular to 1, so that the orbit is 
note closed. 

For later comparison with the quantum-mechanical case; we list here 
some relations between the classical constants of the motion.” Let a par- 
ticular orbit have the semimajor axis with length a, the semiminor axis 
with length b, and eccentricity € = (a? — b?)'/?/2. Then 


f= I? = me7a(1 — e?), 
2a 
4? = zs [Pie (2-6) (2.4) 

m 
Moreover, 

l-4=l1-a=0, 
where 

a. a/,/—2E. 


2 See, e.g., Barger and Olsson (1973), Chapters 4-3. 
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VI.3. Quantum-Mechanical Kepler Problem 


The quantum-mechanical observables of the quantum-mechanical Kepler 
problem for the hydrogen atom are obtained by replacing the numbers 
p;, X; in (2.1), (2.2), (2.3) with the Hermitian operators P; and Q; that fulfill the 
Heisenberg commutation relations (III.2.1). 

For the angular momentum this has been done before; the angular 
momentum operators are given by 


L, = €.0;P, (= 1,2. 3). (3.1) 


The energy operator for the hydrogen atom is conjectured from (2.1) to be 


i= P2 —e’Q-! Q = ./Q? + @3 + Q3, (3.2) 


2m 


where Q~! is the inverse of the operator Q. 


[The mathematical meaning of Q~' requires some explanation. 
As Q; are Hermitian, A = )’; Q? is a positive-definite operator. An 
operator A is positive definite if and only if (f, Af) =O for all 
f¢H. nm particular, A has the property that its spectrum {a} (all 
its eigenvalues) is nonnegative. For every Hermitian positive- 
definite operator A, one uniquely defines a positive-definite Hermit- 
ian operator B such that B? = A. B is written as B = \/A. It has 
the same system of eigenvectors as A, commutes with every operator 
that commutes with A, and has the eigenvalues {/a}. Often this 
definition of the operator JA is inadequate, and one must allow 
for eigenvalues { Se a 

The inverse operator B’* of an operator B is the operator that 
(if it exists) has the property that B-'B = BB"' = 1. If Bis a 
Hermitian operator with spectrum {b}, then the spectrum of B’ ' is 
{1/b}, and B~' has the same system of eigenvectors as B.] 


1 


To conjecture the quantum-mechanical Lenz vector from (2.3), we have 
to agree first what the quantum-mechanical correspondence of the numbers 
(p x 1); = ¢,p;l, should be. As the operators P; and L, do not commute, 
the operator €;;, P;L, is not Hermitian, but instead 


(€ijx Pj Ly)" a €;j¢ Ly P; = Eijx P,L, = 2ihP,. 


As we want the quantum-mechanical observables A; that correspond to the 
components of the Lenz vector to be Hermitian operators, we replace the 
classical quantity by the symmetrized product, 


il 
€ ijk Pj tk — Ei jK 3{P;, Ly}, 
where the anticommutator { A, B} of two operators A and B is defined by 


{A, B} = AB + BA. 
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Thus the quantum-mechanical Lenz vector is 


ae ol e°Q; 
Ay = 5 easlPre Ln) + 3.3) 


For the sake of convenience we shall now introduce some new units. So 
far the momentum P has been measured in eV/c, where c is the velocity of 
light, and the position Q has been measured in cm. The canonical commuta- 
tion relations read 


h 
LPs, 0;] = A 16;;, 


where h = 6.58 x 10718 eV sec = 1.97 x 107° eV cm/c with the velocity 
of light c = 2.998 x 10'° cm/sec. The Planck constant 2zh was the con- 
version factor between the measurement of energy E in units of eV and the 
measurement of energy v in units of sec” ' (frequency units): E = (2h)v = hv 
(Planck-Einstein relation); or it is the conversion factor between the mo- 
mentum in units of inverse length (cm~')—i.e., as the reciprocal of the de 
Broglie wavelength, 1/A--and the momentum in standard units (erg sec/cm): 
p = 2nh/A (the de Broglie relation). After we have convinced ourselves 
that frequency and energy or inverse wavelength and momentum are the 
same physical quantities measured in different units, it is no longer necessary 
to carry the conversion factor h between these different units.* Therefore we 
put A = | in all our calculations, which means that we replace the position 
Q measured in cm with the position Q/h measured in (eV/c)*. We call 
Q/h simply Q. Thus, the canonical commutation relations now read 


[P;,Q,] = —ild,;. (3.4) 


For the particular case of the electron in a hydrogen atom we simplify the 
notation further by using m, H as the energy operator, i.e., instead of P?/2m 
—e’/Q we use the energy operator measured in units of eV-m,, which we 
simply call H: 


2 fs 
H = - = O° where a= = = Ce: (3.5) 
The quantity 
e? 
o— 
he 


is the Sommerfeld fine-structure constant. In these units the Lenz vector is 
given by 


A, = t€u{P1, L,} + aQ;Q7*. (3.6) 


L 


3 That energy and momentum are also the same physical quantities measured in different 
units follows from relativity. 
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The commutation relation of the angular-momentum operators is given in 
these units by 


[dees Lj — ie; jx Ly; (3:7) 


which is immediately seen from (III.2.13). 

Now that we have conjectured the quantum-mechanical observables and 
thus the algebra of operators for the quantum-mechanical hydrogen-atom 
we want to derive the properties of this algebra and compare the results 
with the experimental data. In particular, we would like to know the 
spectrum of H, which will give us the energy levels of the hydrogen atom. It 
is clear that if we knew all algebras of operators in linear spaces generated 
by P;, Q; that fulfill (3.4), then the algebra of observables and space of states 
for the hydrogen atom would be one of them. For the hydrogen atom we 
need only one representation, i.e., one particular algebra of operators in one 
particular space of states. Instead of giving the precise mathematical speci- 
fications for this representation, we shall just go ahead and derive it. We 
remark that the space of physical states (subspace of the Hilbert space) for the 
hydrogen atom is different from the one for the three-dimensional oscillator, 
though the basic commutation relations (3.4) are the same. 

As in the case of the oscillator, the momentum P; and the position Q; of 
the electron are not physical observables, in the sense that the physical 
system (hydrogen atom) cannot be prepared in approximate eigenstates of 
P, or Q;; such a preparation would break the hydrogen atom apart. The 
hydrogen atom, like every nonrelativistic quantum-mechanical system, 
appears in eigenstates (pure or mixtures) of the energy operator. The energy 
operator (3.2), (3.5) of the hydrogen atom differs from that of the oscillator, 
which is of the form H,,. = P? + Q? [ef. (I1.2.1b)]. The space of physical 
states is the space spanned by the physically preparable states. For the 
oscillator these are eigenstates of H,,., and for the hydrogen atom they 
are eigenstates of H ih (3.5). Since the energy operators differ, the spaces of 
physical states differ. 

In the previously considered cases of the oscillator and rotator, new 
observables were introduced, as functions of the P; and Q;, which were more 
directly related to the physically preparable states (e.g., the L; for the rotator). 
After the relations between these new observables had been derived as 
consequences of (3.4), we could forget about the origin of these relations and 
consider them as new fundamental (defining) relations for the particular 
physical system under consideration. We shall follow the same procedure 
here for the hydrogen atom. 

In order to decide what to choose for these fundamental observables in 
the case of the hydrogen atom, we recall that a state vector can be an eigen- 
vector of two different operators only if these two operators commute. 
Physical states for stationary nonrelativistic quantum-mechanical systems 
appear to be in general eigenstates (or mixtures thereot) of the energy 
operator. Therefore we choose operators that commute with H as these new 
observables; these are the quantum-mechanical constants of the motion, as 
we shall discuss later when we consider time development (Chapter XII). 
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As |; and 4; are the classical constants of the motion, we expect that the 
angular-momentum operator L; and the Lenz operator A, will be constants 
of the motion, i.e., commute with H, and are therefore to be chosen as our 
new fundamental observables for the hydrogen atom. A straightforward 
calculation (Problem VI.1) shows that as a consequence of the Heisenberg 
commutation relations (3.4) it follows that indeed 


[H,L=0 and [H,A]=0. (3.8) 


As H commutes with L; and 4,, it will commute with the whole algebra 
generated by L; and A;. Consequently there is no operator in this algebra 
that transforms from a given eigenvector of H with eigenvalue E to a vector 
that is not an eigenvector of H with the same eigenvalue E. Let us denote the 
space of eigenvectors of H with eigenvalue E by &(E). Then all L;, A, and all 
(well-defined) functions A = A(L, A) of them transform a vector f € AE) into 
a vector Af = g that is again in A(E) with the same eigenvalue E. In other 
words, the algebra of L;, A; leaves the eigenspaces &(E) invariant. Let us 
therefore first consider this algebra and investigate the structure of the 
eigenspaces A&(E) of H. 

To identify this algebra better (and to see that it is a simple, well-known 
mathematical structure, we define a new set of operators 


A=) He ie A) ee (3.9) 


We have to ask ourselves whether the definition (3.9) makes sense. On the 
space A(E) there is no problem, because on &(E) / -2H = J —2E, i.e., 
./ —2H is a number, so on A(E) A; and A, differ just by a constant factor. 
Thus the restriction of A; to A(E), which we again call A;, is well defined. 
A, is also well defined wherever ./ —2H is well defined. This is the case for 
the space of vectors f for which (f, (—2H)/f) > 0, according to the definition 
of the square root and inverse of an operator. This space is called the negative- 
energy space, or space of bound states. We restrict ourselves to this space. 
[For the space where +2H is positive definite, one defines A; by Aj; = 
./ 2H A, and find that these 4; fulfill commutation relations that differ from 
those of A; by a factor of (—1), ie., A; and iA; fulfill the same commutation 
relations; see below. | 

We denote the negative-energy space by #. On &, A; given by (3.9) is well 
defined. From (3.8) it follows immediately that 


[H, Ad =0. (3.10) 


By a straightforward but lengthy calculation (Problem VI.1) one can show 
that as a consequence of the Heisenberg commutation relation (3.4) the 4; 
fulfill the following commutation relations with L; and with each other: 


[L;, A;] = 16: Ay, (3.11) 
[A;, A,] = 16; jy Ly A} = A;. (3.12) 
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With the L; and A; one defines the operators 
C, = A? + L? = A? + A2 + A? + Li + 3 + LG, (3.13) 
C,=A:-L=L-A=4A,L, + A,L, + A3L3. (3.14) 


These operators have the property that they commute with the A; and L; (and 
consequently with the algebra generated by A,, L,) if A; and L, fulfill the 
commutation relations (3.7), (3.11), (3.12): 


[C,, Lj] = 0, bOa A;] a 0, (3.15) 
[CL SOC Alo (3.16) 


The algebra generated by the L; and A; that fullfill the commutation rela- 
tions (3.7), (3.11), (3.12) is called &(SO(A4)). 


[It is the enveloping algebra of the four-dimensional rotation group 


SO(4). The algebra generated by L; and A; = wn A,, where 2H 
is positive definite, is the enveloping algebra &(SO(3, 1)) of the 
group SO(3, 1). cf Appendix to Section V.3.] 


The operators C, and C, are the invariant or Casimir operators of €(SO(4)). 
In an irreducible representation space of &(SO(4)), i.e., a space that is obtained 
by applying every element of &(SO(4)) to one vector (ladder representation), 
the operators C, and C, are multiples of the identity operator. That is, they 
each have only one eigenvalue, c, and c, respectively, and these values 
characterize the representation space [in the same way as the value j(j + 1) 
characterizes the representation space of &(SO(3))]. For the particular 
case where L; and A; are defined by (3.1), (3.6), (3.9), one can calculate (Prob- 
lem VI.1) that as a consequence of (3.4), 
Cr—0 2H )e- —d, (3.17) 
Cr (3.18) 
i.e., the operator C, is related to the energy operator (3.5), and C, is the zero 
operator. The relations (3.17) and (3.18) are the quantum-mechanical 
analogue of the classical relations (2.4) between a; and /;. Thus the space A(E) 
is a particular representation space of &(SO(4)), namely the one in which 
co = Oand c, = a*(—2E) ! —-1. 

If we know all possible representation spaces of &(SO(4)) that fulfill 
(3.18), then we know the properties of all the operators L;, A; given by (3.1), 
(3.3), (3.9) in the spaces A(E), for all possible values E with E < 0. It will 
turn out that c, cannot take any arbitrary real value, but rather that the 
spectrum of C, is discrete. [For the invariant operator J* of &(SO(3)) we 
found the same result, namely that only the discrete set of eigenvalues 
jG + 1) with j = 0, 3, 1, 3,... is possible.] Consequently from (3.17) it will 
follow that the spectrum of the energy operator H is discrete and the spectrum 
of H is obtained by (3.17) from the spectrum of C,. 

Our task is therefore to find all the representation spaces of &(SO(4)), 
in particular those that fulfill (3.18). This can be done in essentially the same 
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way as it was done for &(SO(3)) in Section III.4, except that the calculations 
for €&(SO(4)) are more involved. In the next section we shall give a description 
of the properties of the representation spaces of &(SO(4)) which should 
suffice for the understanding of the subsequent material. A derivation of the 
representations of &(SO(4)) has been given in the Mathematical Appendix 
to Section V.3. 


VI.4 Properties of the Algebra of Angular Momentum and the 
Lenz Vector + 


We choose as basis vectors in @(E) the angular-momentum vectors |/m). 
This is possible because 


H, L?, L, 


is a system of commuting operators. Whether this is a c.s.c.o. for the hydrogen 
atom can only be decided by comparison with experiment. Therefore we 
make the assumption that this is a c.s.c.o.—i.e., that |, m and the energy are 
the only quantum numbers of the hydrogen atom—and we shall later see 
that this is true to a high degree of accuracy (neglecting spin effects). Thus 
|! m> is a basis system in @(E). Since the A; are proper vector operators with 
respect to the angular momentum, as stated by (3.11) and U,A;U, = —A;j, 
it follows from the Wigner—Eckart theorem that 


A,|im> = |l-1m+x><Im1x|l—1m+k)><1—- 1] Al]D 
+|{lm+k><m1xllm+ K><IA||D 
+|l+1m+x><Imik|l+1m+xK><l4+ LAID. (4.1 


Since |!m> has been assumed to be a complete basis system, there is no 
parity doubling, and since A; is a proper vector operator, it follows as in 
(V.4.25) that the reduced matrix element </||A||J) = 0. [This can also be 
seen to be a direct consequence of (3.18), which is related to the U, transfor- 
mation property of A- J]. If we insert into (4.1) the explicit values of the 
Clebsch-Gordon coefficients we obtain that as a consequence of (3.11) 
and (3.18), A, is given by 


Ao|im> =/P — m?C,|l — 1m) — /( + 1? — m?C,,,|1+ 1m, 
(4.25) 


A,,|lm> = —\/(1— mI — m — 1)/2C,{1- 1m +4 1) 
— J+ mt Il + m+ 2)/2C4,|1+ 1m + 1), (4.2,) 


A_,|Im> = —./U + m1 + m — 1)/2C,|l— 1m — 1) 
— J(l— m+ Il — m + 2)/2C,,,|1 + 1m — 1), (4.2_) 


where the C, are as yet undetermined functions of the discrete variable 1, 
related to </ — 1||A||/> by Equation (V.3.14). 


4 For a derivation see Mathematical Appendix V.3. 
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The C, are determined by (3.12). We state the result: For every natural 
number n = 1, 2,3,..., there is a function C(” such that (3.12) is fulfilled. 
This function is 


ne =i 


cy" =i 4/2 — 1? 


Cy 0) (4.3) 
From (4.2) and (4.3) one finds the following properties: For every number 
n= 1,2,3,..., there is an irreducible representation space A(n). The lowest 
angular momentum in A(n) is | = 0, because starting from a given value | 
by application of A, one can always reach the value / — 1, according to 
(4.2) and (4.3), unless 1 = 0, because then C% = 0. The highest angular 
momentum is n — 1, because according to (4.2) one can always reach the 
value 1 + 1 from a given value / by applying A,, unless C”) , = 0—which, 
however, is the case according to (4.3) for | = n — 1. As the |! m> are ortho- 
gonal for different values of |, the space &(n) is the orthogonal direct sum of 
the irreducible representation spaces #' of angular momentum: 
n—1 
Rn) == YO, (4.4) 


&(SO(3))  1=0 


where the symbol = is to indicate that the two spaces are the same when only 
the algebra &(SO(3)) acts in them. 

The action of L,, Ly in each of the spaces #! and therefore in #(n) has 
been derived in Section III.3. In particular, 


L3|[m) =m|lm) — L2|I my = 1 + Im) (4.5) 


Thus L; and A;, and consequently the whole algebra &(SO(4)), is known in 
each A(n)(n = 1, 2, 3,...). To indicate that the vector |/ m)> is an element of 
A(n), i.e., that A,, is given by (4.2) with the particular C{”’, we label it with this 
number: |n!m). A straightforward calculation, using (4.2) and (4.3), gives 
that the eigenvalue of the operator C, in the space A(n) isc, = n? — 1: 


C,|nIlm> = (n? — 1)|[nI m). (4.6) 


This gives the set of discrete numbers that are possible for c,. 
The space A&(n) is not only an irreducible representation space of &(SO(4)), 
but also of &(SO(4)) extended by parity Up: 


Unk; Up = +L, (4.7) 
U,pA; Up = —A,;. (4.8) 


Applying the considerations of Section V.4 to the space A(n), where there is 
no parity doubling, one has 


Up|nim> = (—1)'n,|nIm), (4.9) 


where 7, = +1 or, = —1, but may be different for different spaces #(n). 
We will study the n-dependence of y below. 
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VI.5 The Hydrogen Spectrum 


In the previous section we described all the irreducible representation spaces 
A(n) of the algebra of angular momentum and the Lenz vector [i.e., all 
irreducible representations of &(SO(4)) that fulfill (3.18)]. The relation (3.17) 
connects these spaces with the energy eigenspaces #(E) of negative energy 
B= (); 


Cr — 1) "a (—28) * = I, (5.1) 
Thus the energy in each A(n) is given by 


a 


E=E,=-7>35 
: 2n?’ 
and the A&(n) are all possible energy eigenspaces of the hydrogen atom. 
The space of physical states of the hydrogen atom @ is the direct sum of 
all these A(n) 


R= >; ®@ An), (5.2) 


and the spectrum of the energy operator in Z& is 
Specttum H — 2 .,— —a-(2n-)  (n = 1, 2,3,...). (5.3) 


n is called the principal quantum number. 


[The meaning of infinite linear combinations of f, € A(n) (ie., 
1 f,) requires a precise definition. It may mean infinite sum in 
the Hilbert-space sense OF, fall? < co), in the Schwartz-space 
sense ()' || f,|?n?? < oo), or in some other sense. Even if (5.2) is 
meant in the Hilbert-space sense, & is “different” from the Hilbert 
space H = (V2.0 d,|Hosedy = (P? + Q?)6, =(v + Hd, and 

© olld,||? < co}, which is the usual Hilbert space of quantum 
mechanics. The isometric map between #@ and # is physically 
meaningless, as it would map elements that represent different 
physical states into each other. 

Physical states of the hydrogen atom are in any event only 
vectors in the dense subspace ), #(n) of the Hilbert space, where 
the sum extends over an arbitrarily large but finite number of n. 
Any limit n> oo is a mathematical idealization, and physically 
for n— oo the physical system ceases to be a hydrogen atom; it 
ionizes and becomes a system consisting of a proton and electron. 
One can, of course, consider the enlarged physical system, consisting 
of hydrogen-atom states as well as electron-and-proton states. For 
such a physical system & is not the total space of physical states, 
though this space may still be a dense subspace of .#. The space of 
physical states of this system is the direct sum of # and a space #” 
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that is the continuous direct sum of irreducible-representation 


spaces of the algebra &(SO(3, 1)) generated by L; and A; = ./2H 4. 

The space & is an irreducible-representation space of the algebra 
&(SO(4, 1)) [and also of &(SO(4, 2)) and of &(£,)]; the space # 
is an irreducible-representation space of &(SO(3, 2)). Such algebras 
have been called spectrum-generating algebras, the underlying 
groups are called spectrum generating groups. | 


Transitions between different energy levels are performed by the position 
operator Q; (dipole transitions) and powers thereof (multipole transitions). 
Thus the Q; are observables that transform between different spaces A(n). 
Using the property that the Q, (x = 0, +1) are the components of a proper 
vector operator, one obtains 


Q,|nim> = y, ln E— Im+x«)><im1x|l — 1m + «>i + 1)'/2q7" 


+ dln d+ Im+x><Im1ix|l+1m+4+ «(Ql 4+ 1)'/7gr", 


(5.4) 
where the reduced matrix elements 


qi"" = (21+ 1) "?<n'l — 11 Q\\nl, 
qi"” = (21 + 17/7 <n' 1 + 11 |Inly. 


depend upon n’, n. They can be computed (Problem VII.1), but at present 
we only need the property that they are different from zero for all n’ and n. 
The physical meaning of this statement is that there exist dipole transitions 
between all energy levels of the hydrogen atom. 

We can determine the n-dependence of the intrinsic parity n, using the 
U, transformation property of the Q,. Applying U, to (5.4) gives, according 
to (4.9), 


UpQ,|nlm> = (1 a —Im+x><Im1ik|l—1m+4+k)> 
“(21+ 1)!2q," 
+ Geen he! +1lm+K«><UIm1xll+1m+rK) 
x (21 Aye gs. G5) 
But use of (V.4.1) gives 


UpQ,|nim> = —Q,Up|nIm> = —(—1)'n,Q,|n1m) (5.6) 


Inserting (5.4) into the right-hand side of (5.6) and comparing the co- 
efficients with those of (5.5) (.e., making use of the linear independence of 
the basis vectors), one finds 


Neds = NeQie te dg) = pen (5.7) 
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As the reduced matrix elements are different from zero, q""” 4 0, and g"" # 0, 
it follows that 


Ube = Ui = Up (5.8) 
i.€., 7, is independent of n, and the parity of the energy eigenstate is 
Up|nIm> = (—1)'n|nIm), (5.9) 


where y is either + 1 or —1. Equation (5.9) means that the parity of a state 
with a definite angular-momentum value / is the same in all energy spaces 
Rn). By convention one chooses 7 = +1; only the relative parity has a 
physical meaning. 

Each energy eigenspace A(n) of the hydrogen atom is, according to (4.4), 
the direct sum of angular-momentum eigenspaces #’; thus each energy level 
E,, contains the n angular momenta 


P=0.b2s..4n = 1 
Each angular-momentum eigenspace #' in A&(n) is, according to (III.3.24), 
the direct sum of the 2/ + 1 one-dimensional spaces #1: 


api 
A= Y OZ, (5.10) 


&é(SO(2)) m=-l 


which are spanned by each vector |n 1m). &(SO(2)) denotes the subalgebra 
of &(SO(3)) that is generated by L,. The dimension of A&(n) is therefore 


Faey=n, (5.11) 
1=0 


i.e., there are n” linearly independent eigenstates of H with eigenvalue E,, or 
the eigenvalue E, is n?-fold degenerate. 

In order to compare the calculated energy spectrum with the experimental 
data measured in eV, we insert a = m,e?/h from (3.5) into (5.3): 


Se ree Re (5.12) 


(recall that E,, is the energy in eV - m,, E,, is the energy in eV). The number 
4 2 2\2 
m c*m, [e 
R’ = ica ee ee 
2h 2 \ch 


mc? = 0.51 x 10° eV 
e? = 23 x 10° 2% esu? = 23 x 10° 2° dyncm?’, 
cn I 


~ ch 137.036" 


can be calculated using 
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Figure 5.1 Diagram of the energy levels of hydrogen. 


R" = m,e*/(2h7) is the Rydberg constant in energy units. Its numerical value 
is 
ROS13.53 eV. (5.13) 


The energy levels of the hydrogen atom are plotted in Figure 5.1, where 
details in the experimental energy spectrum have not been taken into account. 
The numbers on the left give FE, — Ej,, which is measured in eV. As one sees, 
(5.12) is experimentally very well fulfilled. The energy levels are spaced closer 
and closer to one another as n increases, and they finally converge to E,, = 0 
for n— oo. E > 0 was excluded by our restriction to the subspace where 
— 2H is positive definite; this is the region in which the quantum-mechanical 
Kepler problem does not describe the hydrogen atom, but decribes an 
electron that is scattered by a proton (scattering states). Thus if we choose as 
our physical system the hydrogen atom only, & of (5.2) is the space of physical 
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states. If we consider as our physical system the system that consists of the 
hydrogen atom, electron, and proton, then the space of physical states is 
larger than &, and @& is the subspace of discrete energy eigenstates.° E = 
E.. = 0 is the energy value at which this physical system ceases to be a 
hydrogen atom; beyond this point the electron energy can take any value and 
one can also prepare approximate momentum eigenstates of the electron. This 
means that if one starts with hydrogen atoms in the ground state, one can 
prepare the physical system in an approximate momentum eigenstate of the 
electron ifone transfers to it an energy of at least — E,,and that the system is no 
longer a hydrogen atom. The continuous energy states correspond to the 
hyperbolic orbits of the classical Kepler problem. 

The numbers on the right-hand side of the diagram give the (negative) 
energy in units of cm“ ', ie., the wave numbers v, that are connected with 
E|: 


E’ 

=~. 14 
"Inthe oe 

Inserting (5.12) gives for the wave numbers 

e*m, 1 
= aaa 15 
h°4nc n? C > 
The quantities 
1 e*m 

= —_ R= — 3.29. 10? sec 5.16 
SS Tere ae 
3) pri Me _ 1.997 x 108 m=! (5.16’) 

2nhc 4nhec 


are the Rydberg constant in frequency and wave-number units respectively. 
(Originally it was R that was called the Rydberg constant.) 

The lines connecting the different energy levels depict the transitions in 
the emission or absorption of electromagnetic interaction. According to 
(III.1.1), the frequency and wave number of the absorbed or emitted light 
quantum are 


(in sec™ '), (5.17) 


(in cm~*). 647} 


5 It should be remarked that we are discussing the electron states only, i.c., describing the 
electron under the external influence of the nucleus. If we wanted to describe the combined 
system of electron and nucleus (proton), we would have to take as the space of physical States 
the direct-product space of the electron space ‘@ and the proton space # p: # @ # p(Axiom IV), 
or # @ #%cem,, where Hcy, is the space for the center-of-mass motion. Neglecting the proton 
structure (e.g., proton spin), #p is one-dimensional for protons at rest. 
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The number on each transition line in Figure 5.1 gives the wavelength of 
the light in angstroms: 


A= 1/Vam = €/Vam > 


with tA = 105° cm — 10) * p. 

All transitions terminating in the same lower state form a spectral series 
and each series of the hydrogen atom has a historical name. The series of 
lines corresponding to transitions to the level n = 1 is called the Lyman 
series. The frequencies of the Lyman series are 


1 1 
Vim = RR’ fi = =) (n= 2354..2.): (5.18) 


The line with the longest wavelength is c/v',, = 1215.68 A This line lies in 
the ultraviolet region (cf. Figure I[I.1.2). The series with n = 2, 


Ta 
Yon = B (52 - 2) Ga sae) (5.19) 


is called the Balmer series. It lies in the visible region of the spectrum, and 
(5.19) was first obtained empirically by Balmer in 1885. It played an excep- 
tionally important role in the conjecture of the quantum theory of atoms. 
The other series in Figure 5.1 with n = 3 (Paschen), n = 4 (Brackett), and 
n = 5 (Pfund) lie in the infrared region. 

To each energy level with energy value E,, corresponds the energy eigen- 
space A&(n) of dimension n?. The energy value Eis n*-fold degenerate. Thus 
if the energy of the hydrogen atom is measured as E,, then the state is in 
general a mixture (except for n = 1) and the (normalized) statistical operator 
of this state is 


W = (Tr A,)~!A,, (5.20) 


where A,, is the projection operator on the n?-dimensional subspace A(n). 
Energy measurements on the hydrogen atom do not reveal the value of the 
angular momentum; they just give an upper limit to the possible angular 
momentum values / = 0, 1,...,n — l. 

Then n’-fold degeneracy of each energy level is a peculiarity of the hydrogen 
atom, or the 1/r potential. This is connected with the existence of the constant 
of motion A; in addition to the angular momenta L;. In general one has only 
a (2/ + 1)-fold degeneracy of the energy values as a consequence of [L;, H] 


However, the degeneracy for the hydrogen atom is only approximate, 
and Figure 5.1 is only a rough picture of the experimental situation. The 
above theoretical description, like every theoretical model, gives a description 
of the situation in nature only up to a certain accuracy. Finer details of the 
hydrogen spectrum are shown in Figure 5.2 for the low-lying energy values. 
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Figure 5.2 Low-lying energy levels of atomic hydrogen. (Not to scale.) 


The first number by each energy level gives the value of n. The symbol S, P, D 
stands for the angular-momentum value / = 0, 1, 2 respectively (e.g., 2P 
means n= 2, |= 1, and m can be anything such that —l1<m< +l). 
According to (5.12) all 2S and 2P states should have the same energy, as 
shown in Figure 5.1. This is only approximately the case. The 2P states 
split into two subsets 2P?/? and 2P'/? with 


E(2P?!7) — EQQP'!*) = 0.453 x 10~+eV = 1095 x 107 sec!. (5.21) 


Comparing this with the numbers of (5.12), (5.13), we see that our model 
indeed gives an excellent approximation to the hydrogen spectrum; the 
deviation (5.21) is only a fraction of one percent. The splitting (5.21) (fine 
structure) and the 3P*? to 3P!/? and 3D°’? to 3D” splittings can be explained 
if a new observable, the electron spin, is introduced. The superscripts *””, '’, 
etc. are connected with this new quantum number, the spin. 

The splitting between the levels 2S'/? and 2P'/? cannot be explained 
even after the spin has been introduced. It is again an order of magnitude 
smaller than (5.21): 


EQS'!7) — E(2P**) = 0.0353 cm! 
= 0.437 x 10-5 eV 
= 1058 x 10° sec”! 


and is called the Lamb shift. The Lamb shift is a consequence of the inter- 
action of the electron with the fluctuations of the radiation field, and its 
explanation requires a new theory called quantum electrodynamics. 
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The splitting of each term 1S!/?, 2S1/?, 2P1/, etc. into two levels, the 
hyperfine splitting, is another order of magnitude smaller than the Lamb 
shift: 


E(2S!/? upper) — E(2S'/? lower) = 0.0059 cm™', 
E(2P'!? upper) — E(2P1/? lower) = 0.0020 cm~?. 


In order to explain this splitting one needs to include observables that 
describe the internal structure of the nucleus (in particular, nucleon spin). 

Summarizing, we have seen that our simple theoretical model describes 
the features of the hydrogen spectrum very well. However, to explain the 
fine details requires the introduction of new observables and even completely 
new theories (quantum electrodynamics and nuclear physics). Any given 
theory has only a limited domain of applicability, and this example of the 
hydrogen spectrum serves very well to illustrate that behind each new digit 
in an experimental number may be hidden a completely new domain of 
physics. 


Problem 


1. Fill in the missing steps in the treatment of the hydrogen atom: Let L; be the 
components (3.1) of the orbital-angular-momentum operator, and let H [as given by 
(VI1.3.5)] be the energy operator of the hydrogen atom. Let A, denote the Lenz vector 
given by (3.6). 

(a) Show that as a consequence of the canonical commutation relations the angular 
momentum and the Lenz vector are constants of the motion, i.e., satisfy (3.8): 
[L; H]=0 and [A,, H] = 0. 


(b) Define A; by (3.9). Show that A; is Hermitian and a constant of the motion: 
A} = A; 
and 
[A,;, H] = 0. (3.10) 
(c) Show that L; and A; fulfill the commutation relations of (3.11) and (3.12): 
[A;, LJ = 1; ;, A, 
[A,, A;] — 16; 4, Ly. 
(d) Define the Casimir operators C, and C, by Eqs. (3.13) and (3.14). Derive (3.17) 
and (3.18): 
C, = a*(—2H)™' — I, 
C2 = 0. 
The following fact is useful in carrying out the above derivations: If two operators A 


and B commute, [A, B] = 0, then [f(A), B] = 0 for any well-defined function of the 
operator A. 


CHAPTER VII 


Alkali Atoms and the 
Schrédinger Equation of 
One-Electron Atoms 


In Section VII.1 the concept of perturbation theory is explained using the 
example of the alkali atoms. Section VII.2 represents an algebraic calculation 
of the matrix elements of Q-” (v = 1, 3, 4,...); the results are used for the 
calculation of the energy values of the alkali atoms. Section VII.3 gives a 
brief description of the solution of the Schrédinger equation for the hydrogen 
atom, which is used for an alternative computation of the matrix elements of 
Q~’ and an evaluation of the alkali energy values. It also lists some properties 
of the spherical harmonics which are used in the second part of this book. 


VII.1 The Alkali Hamiltonian and Perturbation Theory 


The alkali spectra are very similar to the hydrogen spectrum. This ts suggested 
by their classical model, according to which the alkali atom consists of one 
electron—the “valence electron” —that moves in the Coulomb field of the 
nucleus and in the average field of the other electrons that are in the orbits 
closer to the nucleus. 
The classical potential V,(r) for this “outer” electron has, for large values 
of r, the form 
Va = e-/e 
because the Z — 1 electrons in the inner orbits screen the charge of the nucleus 
Ze, so that the effective charge will be e. For very small r (smaller than the 
radius of the inner orbits) 
V,(r) = —Ze?/r. 
189 
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Therefore V,(r) can be written 
2} 
Vir) = —— + Vr), (1.1) 


where V(r) is negative and different from zero only in the neighborhood of 
the nucleus. The classical Hamiltonian function of the outer electron is then 


h = hyyar + V(r), @i2) 


where V(r) is a small perturbation of the Hamiltonian hyya,. 

The exact expression for V(r) is not known since it depends upon the un- 
known charge distribution of the core, which consists of the nucleus and 
the inner electrons. If V,(r) is spherically symmetric, then V,(r) may be 
expanded in the form 

e? e? e , 

VAD ae Cope a eg (1.2’) 

where coe is the charge, c,e = ed-x/r is the dipole contribution, c,e = 

eq;jX;X,/r7 is the quadrupole contribution, etc., of the core. For the alkali 

atoms the extension of the core is small compared to the distance of the outer 

electron, so that c, is small and contributions by the quadrupole moment 
q;; and by higher multipole moments are negligible. 

The Hamiltonian operator for the quantum-mechanical system is therefore, 
in correspondence to (1.2), 


H=K+Y, (1.3) 


where K is the Hamiltonian operator of the hydrogen atom, 


(1.3’) 
and 
V ay (Gy ee (13°) 


is the “small perturbation” of K. In (1.3), (1.3’), and (1.3”) we have used the 
units adopted in Section VI.3, 1.e., we have made the replacements Q,/h > Q; 
and m,H —> H. 

The problem is now to find the spectrum of the Hamiltonian operator 
H and its expectation values in the physical states. This can no longer be 
done exactly. Also, we do not know the exact form of V(r) or V(Q), though 
we know some of its properties. We can only hope, therefore, that the 
spectrum of H will not differ significantly from the spectrum of K, i.e., that V 
causes only a small perturbation of the spectrum of K. 

The determination of the alkali spectrum is a problem of perturbation 
theory, which will be discussed in more general terms in the next chapter. 
One major factor in this kind of problem is the choice of an appropriate basis. 
One starts with eigenstates of K, i.e., chooses a c.s.c.o. that contains K. If 
the other members of the c.s.c.o. do not commute with V, then V applied to 
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this eigenvector will not only change the eigenvalue of K but also the 
eigenvalues of these other observables. Thus V will perturb not only the 
spectrum of K but also the spectrum of the other members of the c.s.c.o. 
Therefore the c.s.c.o. should be chosen so that as many of their members as 
possible commute with V. 

For the alkali atoms we have 


LV, L,] = 0, (1.4) 
because 

iO =: 
Consequently, we have the ideal situation where H as well as K can be 


diagonalized together with L? and L;. Thus the basis |n | m) of the hydrogen- 
atom problem is a very appropriate basis to start with. 


We again define 
A, = (—2K)"*?(Geaa{P1, Ly} + 0:0~*), G3) 

and we again have 
C,|nIm> = (A? + L?)|n1m) = (n? — 1)|[nIm). (1.6) 


Unlike the case of the hydrogen atom, however, it will not be true that the 
full Hamiltonian operator H commutes with A,, A’, and C,: 

LH, Ai] = LV, A,] # 0, 

oS Gr UAT er at: 
Thus H and C, (and consequently H and K) cannot be diagonalized together, 
i.e., there are no vectors in # that are eigenstates of H and also eigenstates 


of C,. Let us denote the eigenstates of H by |A/m) and the eigenvalue by 
E): 


(1.7) 


H|Almy = E,|Al my. (1.8) 


H can be diagonalized together with L”, L, because of (1.4).] 

Now the question arises: Which are the physical states, 1.e., which are the 
states that the alkali atoms can be prepared in? According to the basic as- 
sumption III, the alkali atoms should be in an energy eigenstate (or a mixture 
of energy eigenstates) if an energy measurement has been performed on the 
system, i.e., the statistical operator should be 


Wo peepee, OF Winer), (1.9) 


if only the energy, or only the energy and angular momentum, or the energy, 
angular momentum, and z-component of the angular momentum have been 
measured, respectively. Here P, is the projector on the space of eigenvectors 
of H with eigenvalue E,, P,, projects on the space spanned by |A/m)> 
(m= —I, -141,..., +), and P,1, projects on the space spanned by 
|Alm). 

If the state of the alkali atom has not been prepared by an energy measure- 
ment but by a measurement of the observable C, or K, then, according to 
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the basic assumption III, this physical system should be in an eigenstate of 
C,, i.., the statistical operator should be 


We WO=P,. of Wi Po. (1.10) 


where P,, is the projection operator on the space of eigenvectors of C, with 
eigenvalue n? — 1. 

Although at this stage we cannot exclude the possibility that states of 
atoms are prepared by a C, measurement, this possibility appears very 
unlikely, as C, and K have only an auxiliary meaning (except for the hydrogen 
atom) and the physical observable is H. (Also, in Chapter XII we shall see 
that states that do not change in time—and the states that correspond to 
the energy levels of the atoms are likely to be well described by this property— 
have to be eigenstates of the energy operator.) Thus, the occurence of physical 
states given by (1.10) would make the formulation of the theory very un- 
satisfactory and would cause doubts about the appropriateness of III. 
Therefore it will be interesting to see how the predictions of (1.9) and (1.10) 
differ. In this chapter we shall perform the calculation for the alkali atoms 
and see that an answer to the above question cannot be obtained from the 
alkali spectra. In the later chapter on two-electron atoms (helium atom) we 
shall, however, see that experimental data require (1.9), which reassures us 
of the correctness of our basic assumption II[—and the appropriateness of 
the meaning of stationary states (cf. Chapter XII). 

The values predicted for the measurement of the energy operator for the 
state WY” of (1.10) are, according to the basic assumption II, 


CH) wom = Tr(WR A) = Tr(Paim 1) = Tr(PrimK) + TrPaim V(Q)) 
<nlm|K|nIm> + <nlm|V(Q)|nIm)> 


2 
a 
== Eee + <nlm|V(Q)|nilm)> 


74 
a 
Sr e(n, 1) = E(n, I). (1.11) 


For the case where the state is WY" of (1.9), we have 
Cy = Tr(WQ"H) = Tr(Patm H) = <Alm|H|Alm> = Ey. (1.12) 


The term ¢(n,/) in (1.11) is the matrix element of the small perturbing 
Hamiltonian V: 


e(n, 1) = <nIm|V(Q)|nI my). (1.13) 


As |n1 m) is known, this matrix element can be computed if V(Q) is known. 
The value e(n, |) does not depend upon m, because of (1.4). Therefore, for the 
expectation value in the state P,,, one obtains 


TWH) 1 


— nim|H|nIlm)> = E(n, J. (1.14 
Tr W T+1,%$ ua ? Oe) 


(Awe = 
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In contrast to the expectation value E(n, /), E,, is an element of the spectrum 
of H. The spectrum of H and the states |A 1m) need to be determined. The 
spectrum of K and the states |n 1m) are known from the hydrogen-atom 
problem, and in order to calculate E(n, !) it only remains to calculate the 
“small” matrix elements e(n, /). 

The eigenvalues of H and the eigenvectors are determined by a perturba- 
tion-theoretical calculation. It will turn out that E(n, J) of (1.11) will be the 
first-order approximation E{)) for the perturbation-theoretical calculations 
of the eigenvalue E,,. This is the reason we cannot distinguish between 
(1.9) and (1.10). As E(n, 1) cannot be calculated exactly [because V(Q) is 
not known precisely], we can never tell from comparison with the experi- 
mental energy spectrum of the alkali atoms whether E(n, 1) is the exact 
energy value or only a first approximation. 


VII.2 Calculation of the Matrix Elements of the Operator Q 7” 


To obtain the correction term ¢(n, 1) = <nlm|V(Q)|nIm), we calculate 
the matrix elements of the operators Q~” for any integer v. These matrix 
elements will also be used for the evaluation of the fine structure splitting in 
Section IX.4 below. 

We proceed in the following way. We first derive some relations between 
the operators Q~’, L?, and 


P2 
K= co GN) 


O18 


The matrix element of one of these relations will give a recursion relation 
for the matrix elements <n! m|Q~*|nIm). 
We introduce the operator 


1 \Q; QO; i 
P.=-<x =, P; == P,-——. (2:2) 
loro" g 
As a consequence of the Heisenberg commutation relation it follows that 
Cy Pal (2.3) 


| Equation (2.3) follows from 


fostorrd] atom} atr-o-el 


by use of 
LQ, Pi] = iQ;/Q. (2.4) 
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The proof of (2.4) proceeds as follows: 
[Q7, P:] = (Q,0;, P:] = 0,[Q;, P:] + [Q;, P:JQ; = 2iQ;. 
On the other hand 
[Q’, P:] = (QQ, P:] = QL9, P:] + [Q, P:JO = {9,[9, Pil}. 
and consequently 
{Q, LQ, Pil} = 2iQ;. (2.5) 
Furthermore, : 
[Q, [Q, PI] = (QQLQ, P;] + QLQ, P;1Q) 
=(OlG, P10 + [OP AGs) 

= [@, O19, Pill + [@71O, Fe) 

= [Q, {Q, he. P;}}] = [Q, 2i0;] SS 0, 
where (2.5) has been used. However, if Q? commutes with [Q, P;], 
then Q = (Q7)!/? commutes with [Q, P;] also (by the definition of 
the square root of an operator). Therefore it follows from (2.5) that 


20(Q, P;] = 21Q;, 
which in turn gives (2.4). | 


As a consequence of the definition of L, L; = €;,,Q;P,, and the definition 
(2.2), it follows by a straightforward calculation that 


1 
| ieee Raat o? i (2.6) 
With this the energy operator K can be written 
Poel od 
. K = le —. —- —, Dal, 
as OO (2.7) 


P?/2 is the operator of the kinetic energy of radial motion, and Py the 


conjugate to the radius operator Q = OO. is often called the radial 
momentum operator. 


The radial momentum operator fulfills the commutation relation 
LO eee (2.8) 
[ We prove (2.8) by induction on vy. First note that 
e210 OOF FON oO”. 

Then use of (2.3) gives [Q~', P,] = —iQ~*. Consequently (2.8) 
is true for v = 0, +1. Assuming (2.8) is true for v = n, calculate that 
[O*"*, P,) ORO FeO Caen 

= Q-\(-ing-** a ion = = —i(n = HO. "2! 
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Thus if (2.8) is true for v = n, it is true for v = n + 1. In the same 
way, one shows (2.8) is true for v = —(n + 1) given that it is true 
for v = —n. Therefore (2,8) is generally true.] 


As a consequence of (2.8) and (2.7) it follows by a straightforward calculation 
that 


PO ae — 1)00 Pe vO =) (2.9) 
A special case of (2.9) is 
iO] = iP, (2.9a) 
The relation 
[Kip 7-040 (2.10) 


can be checked by inserting (2.7) into the right-hand side and using (2.8). 
With (2.9) and (2.10) one calculates 


[K, [K, Oe] = (v A D(—v ae HOT” iP. = vO Ft) pe 
+ 4v(v + 1)(v + 20-9) ne L797 ¢+3) . ag-°*?), 
(2.11) 


Inserting (2.9a) and (2.9) and taking the commutator of the result with K 
gives 


12 “LQ, K], K] = lO = K] ot VEOE ee, K] 
= [EK 9-*-), K} + fO-”, KL, 212) 
y — 
where one term has been added to both sides. 
Inserting (2.11), (2.7), and (2.9) into the right-hand side of (2.12) gives 


[O-[0, K}, K] + 4vf0-°*, K] = WQ-* PK — (+ HOOT ML? 
+ (2y + 1)a0-°* + bo + IW + DQ°-°T (2.13) 
The recursion relation for the matrix elements of Q ’ is obtained if one takes 


the matrix elements of (2.13) between the vectors |n 1m), for which 


a = 
— Fy lem), 


Pelt =i + 1) | nim), 


K|nIm)> = 


L,\nim>y = minim). 


The left-hand side of (2.13) between the vectors |n 1 m> gives zero, so that we 
get as a recursion relation 


2 
v= ah <nlm|Q-°*|nim> + Qv + Da<nim|Q-°*”|nim)y 
n 


+ [—-( + II + 1) 4+ $00 + DO + 2) <nim|Q-°*?|nlm> (2.14) 
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If one takes the matrix elements of (2.13) between states |n!m)> and 
|n’ I’ my, one obtains recursion relations for the off-diagonal matrix elements." 
For v = —1 Equation (2.14) gives 


<nim|Q-!|nlm> = a/n?. 2.15) 


The matrix elements of Q~? cannot be obtained from (2.14); we do not give 
its derivation here, but just the result: 


A 


- a 
For v = —2 Equation (2.14) gives 
hn Ki + 1) 
<nImiQinimy = 5 (3 - = ). 


Of all the matrix elements of Q~” that can be obtained from (2.14), we list 
the lowest ones: 


<nIlm|Q~3\nIm)y = Way rlmla-7intm> 


a 


~ wil + 1 + 4)’ 


a 3n* — 1 + 1) 
n? 2+ 1)—3 


(2.17) 


<nim|Q~4\nIm)y <nim|Q~3|nlmy. (2.18) 


The constant a has, in our units, the dimension of an inverse length. 
According to (VI.3.5), 


3 a= m,ca. (2.19) 
The above Q is, because of our choice of units, not measured in cm but rather 
in (eV/c)~' [cf. the discussion preceding Eq. (VI.3.4)], i.e., the above Q is in 
fact Q/h. If Q is measured in its usual units of cm, then the above a is to be 
replaced by a/h. Thus for Q measured in cm the above equations hold with 
a replaced by 


¢ eettcees sl 1 220 
a= =—= : : 
h rp. 0.529 x 10° cm o>) 
rz is called the Bohr radius. 
We can now give the energy value 
24 
EG. ee = + <nlm|V(Q)|nImy (2.21) 


'L. C. Biedenharn, N. V. Swamy, Journal Math. Phys. 11, 1165 (1970). 
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for any V that is a sum of positive and negative powers of Q. Equation 
(2.21) is—as we mentioned above and as we shall see in the next chapter — 
also equal to the first approximation of the perturbation expansion. We shall 
use for V(Q) only the dipole term 


V(Q) = —c,a/Q’, (2.22) 


which—as was mentioned above—should be a sufficiently good approxima- 
tion. 
From (2.16) and (2.22) one obtains for E(n, I) 


a’ Zea 
En, ) = ——(1+—*). 
(n) 2n? ( i n(l + 5) Cc») 
Thus the representation spaces &@(n) of &(SO(4)) no longer have only one 
energy value, for H does not commute with A; [In group-theoretical language 
this is expressed as “V(Q) breaks the SO(4) symmetry.” ] 
It is customary to write (2.23) in the form 


OX a 


BH = — onet = — n— o” 


(2.24) 
where n* is called the effective quantum number and o = o(n, I) is called the 
quantum defect. Comparing (2.24) with (2.23) and using the fact that c, is 
small compared to the Bohr radius 1/a, one obtains that the quantum defect 
for the alkali atoms is given by 


c,a 


(2.25) 


The energy levels for lithium, sodium, and cesium are given in the diagrams 
of Figures 2.1, 2.2, and 2.3, respectively. The energy levels are labeled in the 
standard notation. The first number stands for the principal quantum number 
n, and the letters s, p, d, f, ... denote the angular-momentum quantum- 
Humber valuess)— 0, 1,23,..., respectively; egy 3p means n= 3, 1 — I. 
The lines connecting different energy levels indicate dipole transitions. The 
numbers in the lines give the wavelengths in angstroms. These transitions 
fulfill the selection rule Al = +1 [cf. (V4.33)]. Thick lines indicate transitions 
of higher intensity. 

Comparing these energy diagrams with the energy diagram of Figure 
VI.5.1 for hydrogen, we observe the /-dependence of the correction term in 
(2.23), which leads to a splitting of the levels with the same principal quantum 
number. Thus (2.23) gives a good quantitative approximation to the experi- 
mental energy spectrum. We observe further that some of the energy levels 
split in contradiction to (2.23); e.g., there are two energy levels 3p, and 3p, 
of sodium, whereas according to (2.23) there should be only one. According 
to (2.14) through (2.18), even if one takes for V(Q) the general spherically 
symmetric expansion (1.3”), the energy values should depend upon n and | 
only. The splitting of these levels with the same value of the quantum 
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Figure 2.1 Energy levels of lithium atoms. 


numbers n, | will be explained by the introduction of the new observable 
spin, which will be discussed in Chapter IX. 

We also observe in the experimental energy diagrams of Figures 2.1—2.3 
that for lithium there are no energy levels with n = 1 or 2, that for potassium 
(which is not depicted here) there are no energy levels with n = 1, 2, or 3, 
and that for cesium there are no energy levels with n = 1, 2, 3, 4, or 5. This 
means that our physical system, which is the outer electron in the electro- 
static potential V(r) = —e?/r + V(r), cannot be in a state with principal 
quantum number n = | (lithium), n = t or 2 (sodium), etc. In Chapter X we 
shall see that this can be explained as a consequence of a new basic assump- 
tion of quantum mechanics, the Pauli exclusion principle. 
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Figure 2.2 Energy levels of sodium atoms. 


VII.3 Wavefunctions and Schrédinger Equation of the Hydrogen 
Atom and the Alkali Atoms 


The Solution of the Schrédinger equation for the hydrogen atom 1s so 
familiar that it need not be discussed in detail here. It does not lead to any 
new observable results, which have not already been discussed in the previous 
chapter. The new quantities that it provides are the wave functions W,,,,(x) = 
{x1,X2,X3|1nm> = <x|n 1m). The quantity |W(x)|* represents, according 
to the results in Chapters I and II, the probability density for the observation 
of the electron in the state |n / m) at the position x. Such a position measure- 
ment, however, cannot be performed. Nevertheless, solving the Schrédinger 
equation is the most common method of calculating the energy values, 
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Figure 2.3 Energy levels of cesium atoms. 


and it is often a very convenient method. Furthermore, the wave functions 
will be of physical importance for scattering problems. We give here only a 
brief description of this method, since detailed calculations are given in 
many good standard textbooks on quantum mechanics. 

The Schrodinger equation for the one-electron atom is obtained by taking 
the “scalar product” of the energy eigenvalue equation 


H|nIlm> = E|nlm) 


with the generalized eigenvector|x> = |x,x, x3). Using(2.7) one obtains then: 


<x| & ze 9 + Zo) |InIm> = E<x|nIm). (3.1) 


For the hydrogen atom V,(Q) = Viaya(Q) = —a/Q. Equation (3.1) also 
applies to any other one-particle problem in three-dimensional space if 
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V,(Q) is replaced by the appropriate potential-energy operator, and the 
|n 1m) by the appropriate basis vectors. 

In the basis of generalized eigenvectors |x) of the Q ;, the position operators 
Q; act like multiplication by x;: 


<xlQilW> = xi<x|p>. (3.2a) 


The momentum operators P;, according to (I1.8.59), act like differentiation 
with respect to x;:? 


ix|Pilp> = Fa ol. (3.2b) 


We label the generalized position eigenvectors |x» by the polar coordinates 
(r, 6, p) instead of the Cartesian coordinates: 


x, =rsin@cos ¢, x, =rsin @sin ¢, X3 = rcos 80. (23) 
From (2.2) and (3.2) it then follows that 


cx Psd) = 5 (5 +) <ul> 


(3.4) 


<KIPEIV) = — 25 (5 <xlv9). 


The matrix element of the angular momentum operator L; = €;;,Q;P, in 
the basis of generalized eigenvectors of the Q; is, according to (3.2), given by 


(x[LilW> = 5 eins «xh. (3.5) 


If the generalized position eigenvectors are labeled by the polar coordinates 
(3.3), the differential operators on the right-hand side of (3.5) have to be 


2 Note that the derivation of (11.8.59) required, in addition to the Heisenberg commutation 
relation, the assumption that P? + Q? had at least one eigenvector (or equivalently P? + Q? 
was essentially self-adjoint). Thus (3.2b) is not a consequence of (VI.3.4) only, but requires an 
additional assumption. This assumption is natural for the case of the three-dimensional oscillator, 
because then P? + Q? represents the observable energy. For the hydrogen atom the energy is 
given by (2.1), and P? + Q? need not correspond to any observable for the hydrogen atom. Thus 
(3.2b) is an additional assumption, and the description of the hydrogen atom given in Chapter VI 
is not completely equivalent to the description by the Schrédinger equation. # given by (VI.5.2), 
considered as a Hilbertian direct sum, is a complete Hilbert space; it is a representation space 
of a unitary representation of SO(4) and of an irreducible unitary representation of the groups 
SO(4, 1) and SO(4, 2). In # the operator C, given by (VI.3.13) and therewith, by (VI.3.17), 
the energy operator of the hydrogen atom is self-adjoint. The space of square-integrable 
solutions of the Schrédinger equation (3.1) with discrete eigenvalues (the space of bound states) 
is not a complete Hilbert space. Remarkably, these distinct topological properties do not make 
any difference for the observable quantities. 
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transformed from Cartesian to spherical coordinates; the results of that 
somewhat lengthy transformation are 


cxlLaIv> =; (sino — cot 0cos rool (3.6;) 


op 


<xlLalv> =; (+008 4.55 ~ cot Osin d=, )<roiy> (3.6,) 


op 


KxIEsIV> = 5 35 (PO BIW). (3.6) 
From this one calculates 

a js 3 
(5 0d¢2 sin 000 
For |w> = [nl m) the equations (3.63) and (3.6) become 


<x|L? p> = — (sin 5 wa) ) ro olW> (3.6) 


7 ap SPO bln my = mr 0.d|nIm> (3.63’) 


and 
Lao? Dea 
sin? 00g? sin @ 00 


(sins 55) )<rom Im> = Kl + 1) <r0¢|nIm). 
(3.6’) 
Inserting (3.4) into Cy gives 


1106 
Or ar 


aS a 1) 


(P53 ap Ola! m>) 40 <rO6d|nim> + Vir)<ré6dlnim) 


= E<r@o|nIlm>. (3.7) 


Using the expansion 


|nIm> = [ dxtxy cain Im) 


= [1 sin @ dr d0.do)[r 0 6) ¢r-0.d\nIm> (3.8) 


in the orthonormality condition 
Cn lm |nim> = dap On Omi (3.9) 


gives the normalization condition for the transition coefficients: 


fr sin 0 dr dO db<n'I'm'|r 0 h> <r 0. b|nlM> = Sam Si Sm (3.10) 
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Equation (3.7) for a given value of | is a differential equation in the variable 
r only, while (3.65) and (3.6’) are differential equations in the angular vari- 
ables 6, @ only. Therefore the transition coefficients are written 


<r b|nilm> = RulT)¥im(O, ?). (3.11) 
Equations (3.7), (3.6’), and (3.63) then lead to 


fea. 0 Ki+1 
Fe (" a) + HC] Rad = E(n, DRA"), (3.12) 


1 ae IC he a0 
sin 2) ad? 7s sin 0 60 sino 5) Yim(9, ?) = Kl te LY ls ¢), (3:13) 


and 


1 0 
i a Yim(9, P) = MYin (8, ¢). (3.14) 
For the hydrogen atom V,(r) = —a/r and E(n, 1) = —a?/(2n’). The normal- 
ization condition (3.10) gives the following normalization conditions for 
he, ane Y,.: 


i ar RaAT)R. Ar) = Onn, (3.15) 
and 
| sin 6 d0 dd Yym(O, $)Yn(8s b) = Say Spm (3.16) 


The normalized solutions of (3.13) and (3.14) are the spherical harmonics 
Sy Eo OO alae oo 
a oy eee | aes) = CID 


It should be noted that different authors use different phase conventions in 
defining the Y,,,(0, @). When m = 0, (3.17) becomes 


21 +1 
Yio(9, &) = Pe P,(cos 9), (3.18) 


where P, is the Legendre polynomial. 
It is frequently convenient to relate (3.17) to the associated Legendre 
function 


_yyntl 1—|m| 
= Tapia tol sine’ @ 


mi)! dcos 0 


|m| ae 
Get Ly i Sm)! 


For m > 0 the desired relation is seen to be 


2i+ 1(l—m)! 


4n (1 +m)! a. Sle) 


Yim(9, @) = (— 1)” 
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for negative m one uses 
Y,, -m(9, 6) = (—1)"Yin(8, $). (3.19') 


Occasionally we shall require the explicit form of the spherical harmonics. 


In tabulating the formulas it is convenient to write Y,,,(0, 6) = Oi gine Dar. 
Working from (3.17), one then finds 


©,,(0) = —/3sin0, © ,0(8) = ./3 cos 8, 
©,,(0) = /2sin? 0, 2, = —./ticos O sin 8, (3.20) 
©39(0) = ,/$(2 cos? 6 — sin? 6). 


Finally we quote three important relations involving the spherical 
harmonics. The first of these states that {Y,,,} is a complete set of single- 
valued functions on the unit sphere: 


Z Tee ee (3.21) 


m=-1 sin 8 


The other relationship is the addition theorem: Let n, and n, be unit vectors 
with orientations specified by (0,, 6) and (6,, @2), respectively; then 


4 : v 
PAM) = 37 DL Yn b)¥n(O2, $2). (3.22) 


The last relation is the coupling rule:* 


(21, + DQ + YP? 
4n(21 + 1) 


Tite: P)Y im (9, 9) = » | 


x <I,m,l,m,|lm) <1,01,0| 10) ¥,,,(0, 6). (3.22b) 


The normalized solutions of Equation (3.12) for V,(r) = —a/rand E(n, |) = 
—a?/(2n*) are 


n—I—1 sig 
RG) = Gna) nin + YS Qnar\ Los (nane (3.23) 


where L2'4'(p) are the Laguerre polynomials. a = 1/rgo,, iS the inverse 


Bohr radius in units eV/c [cf. (2.19)]; in the usual units in which r and rgoh, 
have the dimension cm, a is to be replaced by a’ of (2.20). 

Figure 3.1b shows R,,(r) for some of the lower values of n and /. Figure 
3.la gives the probability distribution for finding the electron at r. 


3 See, e.g., Rose (1957, Section 14). 
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Radial probability densities r?R?(r) 


0 5 10 15 20 25 30 r/rs 
d states (I = 2) and 4f state 


(a) 


Figure 3.1 (a) Charge distributions of the lowest states of the hydrogen 
atom. The radial distance r is in units of the Bohr radius. The curves are 
labeled by the numbers nl. 


Knowledge of <x|n/m) allows an alternative computation of the matrix 
element of V(Q) to the one given in the previous section; ¢(n, !) = 
<nilm|V(Q)|n1m) can now be calculated by inserting the complete general- 
ized basis |x): 


e(n, 1) = [as [ae <nlm|x> <x|V(Q)|x’> <x’|n lm) 


= fax |<nlm|x)>|?V(r). 


Multiplying (3.23) by Y,,,(0, @), we obtain, according to (3.11), the transition 
coefficient <r@¢|nim)>. Substitution of <r0@|nim)> into the above 
equation and use of (3.16) to perform the 0, @ integrations then yield 


e(n, 1) = (2na)> es 1)! par) (Qnaty i Onar)|\e. o 
(2n(n + I!) 
(3.24) 
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Figure 3.1 (b) The radial wave functions R,,(r) for hydrogenic atoms 


for n = 1, 2, 3. Each curve is labeled by nl. Note the effect of the 
centrifugal force in “pushing out” the wave function from the nucleus. 
Note also that the functions have n — | — 1 nodes. [Part (b) from 
Dicke & Wittke (1960), with permission. | 


These integrals can be evaluated for the various functions V(r) = —c/r?, 
V(r) = —c/r’, V(r) = =c/r*,... and lead to the same result as given in 
Equation (2.14)-(2.18).. From Figure 3.la we can make some qualitative 
observations regarding the values ¢(n, !). For a given value of n the probability 
distribution moves towards larger values of r as / increases. Therefore, for a 
given value of n, —e(n, !) is the largest, so that E(n, | = 0) will be the lowest 
energy value, and with increasing / the E(n,!) will approach the value 


—a’/(2n*). That this is indeed the case can be seen from the experimental 
energy diagrams of the alkali atoms in the previous section. 


Problem 
1. Calculate the matrix elements of the position operator Q, (x = —1,0, +1) 


between the SO(4) states |n 1m) of the hydrogen atom, i.e., calculate 


<n' lm |O,|n im). 


It is sufficient to do this for the third component Q,. (Hint: Use the matrix elements of 
the modified Lenz vector A, and algebraic relations between the Q, and the A,.) 


CiIAPTER VItI 


Perturbation Theory 


In this chapter perturbation theory is developed in a general form that is 
easily adaptable to the discrete as well as the continuous spectrum. At the 
end of Section 1 the Wigner-Brillouin and the Rayleigh-Schrédinger 
perturbation expansions are obtained as special cases. Section 2 applies this 
general procedure to the continuous spectrum and results in the Lippman-— 
Schwinger equation. 


VIII.1 Perturbation of the Discrete Spectrum 


This section will develop the general procedure called perturbation theory 
for the evaluation of the eigenvalues E, and eigenvectors |A), 


H|A> = E,|4), (1.1) 
of an operator 
H=K + V, (1.2) 


which differs by a small perturbation V from an operator K whose spectrum 
€, and eigenvectors |a>, 

K|a> = €,|4), (1.3) 
are known. To have a concrete case in mind, one may assume that H, K, and 
V are the operators of Section VII.1, that 


[A> = |Alm), |a> = |nIim) 
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for any fixed values of I, m (which is possible, since [L;, V] = 0), and that! 


(mca)? _—m,e* 1 


oe = — = : 
acaer 2m,n? 2h? n? 


In this kind of perturbation-theoretical treatment it is assumed that to 
every eigenvector |a> of K there corresponds an eigenvector |A(a)> of H 
such that E,,,) is given by €, plus a small correction term, i.e., such that if V 
is zero, then E,.,, = €,- In the case that €, is degenerate (i.e., that there are 
other quantum numbers y besides a or A), E44) will in general depend upon 
these additional quantum numbers even though €, does not. 

Although at the moment we have in mind discrete values for ¢€, and a, 
we do not want to exclude the possibility that K has a continuous spectrum ;7 
in that case the summation )’,, in the equations of this section, is to be re- 
placed by an integration over the continuous spectrum (cf. Section VIII.2.) 

We define the operator 


G(E) = (EI — H)™! (I = unit operator), (1.4) 


which depends upon the parameter E. G(E) is called the “operator for the 
Green’s function for the Hamiltonian” by the physicists, and is called the 
“resolvant of the operator H” by the mathematicians. It is (intuitively) clear 
that the task of finding the eigenvalues E, of H is equivalent to that of finding 
the “singularities” of G(E). 

We write G as the product of two operators 


GC — iq. (Gl), 
where 
g(E) = >) |a><a|G,CE), (1.6) 
G,(E) = <a|G(E)|a>, (1.7) 
and F is defined by (1.5). If (1.5) is inserted into the identity? 
(E — H)G(E) = I, (1.8) 
one obtains 
(E — K)Fg = 1+ VFg = 1+ Rg, (1.9) 
where the operator R(E) is defined by 
R = VF, (1.10) 


and its diagonal matrix element is denoted by 


R, = <a|VF|a). (1.11) 


' We revert to the usual cgs units in this chapter. 

? The following treatment of perturbation theory is based on the Goldberger Watson Colli- 
sion Theory (1964, Section 8.1). It has been chosen because it displays the connection between 
the cases of the discrete and the continuous spectrum. 

3 E stands for E- J; the unit operator in a product with a number will be omitted. 
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R is called the level-shift operator and R, the level shift, for the following 
reason: If one takes the diagonal matrix elements of (1.9) one obtains 


(E — €,)<a|F|aG, = 1+ R,G,. (i) 
It is easily seen from the definitions (1.5) and (1.6) that 
Sar ay — 1 GRD) 
Consequently, 
1 
G,(E) = ———_—____.. : 
OS lee (5) (1.14) 


As the eigenvalues E, are the singularities of G(E) and consequently of its 
matrix element G,(E), one obtains from (1.14) 


E, =e, + R,(E;). (1.15) 


Though (1.15) is valid for any E, and €,, it is used in practice to determine 
that value of E, which is “closest” to €,, i7E,(,). Thus R,(E,) represents the 
level shift from the “unperturbed” energy value e, to the corresponding E, (4). 

As already stated above, the assumption* that forms the basis of this 
perturbation theory is that to every |ay)> there corresponds a |A(a)n> 
(i.e., there are as many |an) as there are |A(a) n>), but E,(q) may be a function 
of 7 because R,(E,) does in general depend upon n—e.g., in the example of 
Section VII.1 


R,(E,) = <a|R(E,)|a> = «nl m|R(E,)|nIm)> (1.16) 


—and will in general depend upon n, /, m. In the particular case [R, L;] = 0 
it follows that R,(E,) does not depend upon m, but it will depend upon / in 


*+This assumption, which underlies perturbation theory, can be shown to be fulfilled for 
certain V, as the following theorem states: 


Let 
H = H(k) = K+ KK + 4?K® +... 

with 

(k) M 4 

[KM || < ai (Fl + ||Hfll) (kK =1,2,...), 
where the numbers M and r are positive constants. Then for |x| < r the eigenvalues E(x) of H 
can be written 
E(k) = E® + KE 4 «?E@ + .---, 

and the eigenvectors | E(x)> of H can be written 

|E(K)> = |B) + KJEM) + 17 [ E> + ---. 
For x — 0 it follows that 


Hk 
E(k = 0) = E® = «,, 
|E(K = 0)> = |E® = |a). 
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addition to its dependence upon n. Thus all the |a7> belong to one energy 
level €, of the unperturbed system, and under the influence of the perturbation 
this level splits into sublevels E,,.,, and the state |an> of the unperturbed 
system goes into the state |A(a) > of the perturbed system. 

The level-shift operator R, and thus R, and E,, are evaluated by various 
sequences of approximations. These various sequences are obtained in the 
following way: (1.9) can be written 


(E — K — O)Fg = 1+ (V — O)Fg, Ga?) 
where O may be any operator. We restrict O to be of the form 
O(E) = ), la) <al0,(E), (1.18) 


where O,(E) are numbers (i.e., O is assumed to have the property [O, K] = 0). 
From (1.17) one obtains 


: ae (V — O)F. (1.19) 


er =K—o0, 2 RO 


This equation is not defined on every vector, not even on every vector |a), 
for any value E. That is, the first term is only defined on those |a> and for 
those E for which 


(E 7) O,(E))G,(E) # 0. 


As discussed above, to every eigenvector |a> of K there corresponds an 
eigenvector |A(a)> of H. The operator F(E) has been constructed such that 
F(E,(q)) is the operator that transforms |a)> into the corresponding |A(a)). 
To show this we calculate F(E)|a> at the value E = E,,,,. Applying (1.19) 
to |a> and making use of (1.6) and (1.14) gives 


| 


NE aero E) 


(E — «, — R,(E))|a> 


I 

HE ELK O(E) O(E))F(E)|a> (1.20) 
O,(E) will be chosen such that this expression becomes the simplest possible 
relation for F(E,,,))|a>; this can be done if at the value E = E ayo P(E ya) |@> 
is written as |a> plus a correction term. Thus O,(E) is to be chosen so that 
the first term becomes |a). 

Two choices of O(E,4)) lead to particularly well-known perturbation 
expansions: 


OE xa) = RAE xa)|4> <a, (1.21) 
or 
OAE ya) = Ry(Ex@)Oa'a; (1.21a) 
and 
O(E 4a) aa R,(E ja) |a> (ala (Ex =) 3 la’><a'|, (1:22) 
ata 
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or 
O{E xa) = REx) OAE yay) = Eqs) — €g fora’ #a. (1.22a) 
In the case (1.21), 
(E — K — O(€) p= 24,2, = 14> <a\(E — €g — RAE) 52 Exe 
+ 2) a’><a'\(E — €y)e=eyq3 (1-23) 
and in the case (1.22), 
(E — K — O(€)) 52k, = la><al(E — €, — RAE) )e* Bay 
a d la <a (ene) (1.24) 


Thus in both cases (1.21) and (1.22) the first term on the left-hand side of 
(1.20) becomes |a)>. Inserting (1.23) into (1.20) gives 


F(E ja) |@> =|a> + la><a|(V — O(E ja) )FCE yay) |@> 


1 
E xa) eb gt == RE yay) 
1 
oe Ee 87 ANY — OE a) FE x) 14>. (1.25) 
a’ #a *i(a) €q' 
The second term on the right-hand side is zero because 
<a|(V — O(Eyq))F(Exa)|a> = 9, (1.26) 


which follows from (1.21), (1.10), and (1.13). 
Inserting (1.24) into (1.20) gives 


1 
E xa — RAE xa) 


F(E ja) |@> = |a>+ la ><a — O(E ja) F(E yay) |@> 


ae (1.27) 


a’ #a €a 


The second term on the right-hand side is again zero because again (1.26) 
holds as a result of (1.22), (1.10), (1.13). Thus the cases (1.21) and (1.22) both 
lead to a very similar form for the equation of F(E,,,))|@>. It is now easily 
established that 


HF(Exq@)|a> = E yah (Ea) |a>- (1.28) 
Multiplying (1.25) by (E44) — K — O(E,(q))) and using (1.21) gives 
(Exa — k= O(E xa) F (E ya))|4> 


Exa — a’ 


Lae Ex = 


a’) <a'|(V — O(E jay) )F(E xa) | >. (1.29) 


= 


212 Perturbation Theory 


In the )”, it is irrelevant whether the term with a’ = ais included or excluded, 
because of (1.26). Consequently (1.29) reads 


(Ena — K — O(E ya) F(Eua)la> = (H — K — O(Exa))F(Ex@)|a>, (1.30) 


which establishes (1.28). In a similar way (1.28) can be established for (1.27). 
Equations (1.25) and (1.27) can be simplified. Inserting (1.21) into (1.25) 
gives 


F(Ey@)|a> = |a> + aS is ><a’ |VF(Eya)|a>. (1.31) 


a’ ta Lag = 


This is an equation for F(E,,,))|a>, and can be iterated by successive substi- 
tution of the left-hand side into the right-hand side: 


F(Eyq@)|a> = |a> + x phe en ><a'|V a> 
a’ #a *“A(a) 
1 1 7 i, " wt 
+» i la’) <a’ |Vlia"><a"|Via> +--+. 
aa A(a) Eq’ Aa) Eq” 


(1.32) 


The level shift (1.10) and therefore, by (1.15), the eigenvalue of H is obtained 
from (1.32) as 


Ex = a + a|V|a> + LE eee <a|Vla’><a'|V|a> +--+. (1.33) 
This provides an equation for E,,., in terms of the unperturbed energy value 
€, and the matrix element of the perturbation V between the unperturbed 
states |a>. Equations (1.32) and (1.33), derived from the choice (1.21) for 
O(E xa)), constitute the Wigner—Brillouin form of the perturbation expansion. 

The choice (1.22) of D(E4(q)) leads to the Rayleigh-Schrédinger perturba- 
tion series. Inserting (1.22) into (1.27) leads to 


F(E xq) Ia) = la) +S —— |a)<al(V — SE\F(Ey@)Ia>, (134) 


where OE is defined to be the difference between the perturbed and the un- 
perturbed energy: 


OE = Eya — Ca: (35) 
From (1.34) one obtains with (1.10) and rie an equation for 6E: 
OE = RE ya) => ROE + €,)- (1.36) 


We iterate (1.34) and (1.36) to the first and second order, respectively. We 
denote by Af” the nth-order approximation of dE, and by F® |a) the nth- 
order approximation of F|a)>. In zeroth order, 


A® = EO _¢ =0 Fig) =a), (1.37) 
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In first order, 


AW = R(AI® + €,) = Ra(e.) = <a|VF|ay = <al|Vla>, (1.38) 


1 
FYlay =|a>+ ¥ : la’><a'|(V — AD) F™ | a> 
a#a“a” “a’ 
l I I 
ee le, a) a> (1.39) 
a’ #a &q — Eq’ 


In second order, 


Ay) = R,{Ag” + €.) = <a| VF™ a) 


<a|V \a') <a'|V |a). (1.40) 


= <a|Vl|a> + y : 


a’'#a &a — €q’ 


VIII.2 Perturbation of the Continuous Spectrum— The 
Lippman-Schwinger Equation? 


In the previous section it was nowhere stated that €, is a discrete eigenvalue 
and |a) a proper eigenvector of K, though all equations were written as if 
this were the case. The assumption of a discrete spectrum is in fact un- 
necessary; we shall now discuss in more detail the case of the continuous 
spectrum, which, with the tools of Section II.8, is easily obtained by replacing 
the proper vectors |a> with the generalized vectors, and the discrete sums 
by integrals. 

For the following discussions, we have to make some assumptions regard- 
ing the properties of the operators K and H = K + V. We shall assume that 


spectrum K C spectrum H. (21) 


This is a general enough assumption to incorporate the situations that occur 
for physical problems. Usually for scattering and decay processes K has 
only a continuous spectrum that is positive; the continuous spectrum of H 
agrees with the continuous spectrum of K, and in addition H has a set of 
negative discrete eigenvalues (and sometimes also positive discrete eigen- 
values lying in the continuous spectrum) that are bounded from below. 

In this case of a continuous spectrum the level shift R, is not an observable 
quantity. However (1.15) still holds, for (as remarked in the previous section) 
it is true for any value E, in the spectrum of H and any value, in the spectrum 
of K. 


5 The results of this section will not be used until Chapter XIV. 
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We shall consider (1.20) at the value E,,4) = €,, 1.¢., we shall calculate 
F(E y(a))|@> for that value E44) € spectrum H which is identical to the eigen- 
value €,€spectrum K corresponding to the generalized eigenvector |a)>. 
Such a value E,,,,) exists in the spectrum of H because of the assumption (2.1). 
We call this value E,,,, = E,. For E, (1.15) becomes 


Ie See). (2.2) 


If we again choose (1.21) for the operator O(E,) we will again arrive at (1.31), 
which in the continuous form is written 


F(E,)|a> = |a> + fags me ><a |VF(E,)| 4). (2.3) 


*{ means the principal-value integral, i.e., the integral over all a’ with the 
omission of an “infinitesimally small interval” around E, = E,. da... is 
a symbolic way of expressing summation over the discrete label and inte- 
gration with respect to a suitable weight function over the continuous label. 
The label a in |a> stands for the energy E, and some additional labels n,, 
which may be discrete or continuous: |a> = |E,n,>. If the |a> are “normal- 
ized” so that 


Cn EE ene> = p- \(E,)O(E, = Bone. 
then 
[da=¥ [ oeae, 
na 
[cf. (11.8.24) and Chapter XIV]. Again (1.28) is fulfilled, ie., F(E,)|a> is a 


generalized eigenvector of H with eigenvalue E,. This can also be seen directly 
from (2.3): Multiplying (2.3) by (E, — K) and using 


K|a> = E,|a) 
and 


| aa’ a’ <a Sp iexcal, 


one obtains 


(I — |a)<a|)(H — E,)F(E,)|a> = 0. (2.4) 
In the case that we have (2.2) and (1.21), Equation (1.26) becomes 
(a|(H — K)F(E,)|a> = 0. (1.26) 


This together with (2.4) leads to 
(H — E,)F(E,)|a> = 0. (1.28’) 
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We now make use of a well-known relation from the theory of generalized 
functions, ° 


1 1 i, 
0 ong ait = ae + ind(x). (2.5) 
Then (2.3) can be written 
1 
F(E,)|a> = i) TF 
(Ejay = |a> + | da =—— 1a’) <a |VFLE, a) 


2 e | aa O(E, — E,,)\a’><a'|VF(E,)|a>. (2.6) 


Integrating and using (2.2), we see that the last term in (2.6) vanishes: 
tin|a)<a|VF(E,)|a> = +inR,(E,)|a> = 0. 


Because K|a’> = E,,|a’>, Equation (2.6) may be written in the more usual 
form 


F(E,)|a> = |a> + V(F(E,)|4)). (2.7) 


Ee ko 10 


This is called the Lippman—Schwinger equation. It is an equation for the 
generalized eigenvectors 


|a> = |a(a)> = F(E,)|a> (2.8) 


of the operator H with eigenvalue E,; E, is equal to the eigenvalue ¢, of K 
belonging to the generalized eigenvector |a>. Thus 


H\a> = H|a(a)) = E,|a>  K\a> = E,|a). (2.9) 
The Lippman-Schwinger equation is often written 
1 
E, -K+i0 


|a*> = |x*(a)> = |a> + V lam >: (2.7') 


Besides the solutions of the Lippman-—Schwinger equation |«* >, there are 
other basis vectors of interest. These are obtained from the form (2.3) of the 


© The proof of (2.5) may be found in Gelfand et al. (1964, Vol. 1, Chapter I, Section 2.4). 
Further discussion of generalized functions of the type (x + i0)* may be found in Chapter I, 
Sections 3 and 4 of the same volume. Relations between generalized functions (distributions) 
like (2.5) can be understood in the conventional sense when they are multiplied with a well- 
behaved function @(x) and integrated. Thus (2.5) means 


1 y 1 
fax d(x) = fax — (x) + in fax O(x)p(x) 
x 


x + 10 


for all well-behaved functions @(x). Analogously, relations between generalized eigenvectors 
like (2.6) can be understood in the conventional sense when the scalar product is taken with a 
well-behaved vector ¢ € ® (cf. Section II.10). 
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integral equation for F(a)|a>, which in operator form is written as 


|e 
F(E,)\a> = |a) + ———, VFCE,)|a), (2.10) 
E,—K 
where P indicates that the principle-value integral is to be taken. To dis- 
tinguish the solution of this equation from the two solutions of the Lippman— 
Schwinger equation we denote it by |a”> and write (2.10): 


| oc? => |a> ar —— ViaP>. (2.10’) 


a 


Problems 


1. Let the energy operator be given by 
H=K + aQ° + BQ*, 


where 


(anharmonic oscillator). 
(a) Calculate the eigenvalues to first order in the perturbation expansion. 
(b) Calculate for 6 = 0 the eigenvalues of H to the second order of the Rayleigh- 
Schr6dinger perturbation series. Compare the result with that of (a). 
(c) Calculate for B = 0 the eigenvalues of H to the second order of the Wigner- 
Brillouin perturbation series. Compare the result with those of (a) and (b). 
2. According to Equation (VIII.1.20): 


1 
F(E)|a> = E-K-o0B — KK RAE) a> 


1 


42 P2k Op) (V — O(E))F(E)\a), 


which for O,(E) = R,(E) can be written 


i 


Further according to Equation (VIII.1.28) is for E = E,,,) the vector F(E)|> an eigen- 
vector of H: 


AF(E ya) |a> = ExaF(Eya)la>. 
Consequently the above equation becomes at E = E, 


1 
F(E,)|a> = ja) + ES om (E, — K — O(£,))FE,)|a@> 


= |a> + F(E,)|a>, 


from which it follows that |a> = 0. Give your comment of the above arguments. 


3. Show that [Equation (1.28’) 
HF(E,)|a> = E,F(E,)|a), 
where according to Equation (VIII.2.3) 


1 
ESE, 


P 
F(EJ\a) = lay + fda la’) <a! | VF(E,)|a) 


is the continuous-spectrum analogue to (VIII.1.31). 
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CHAPTER IX 


Electron Spin 


In Section IX.2 the doublet finestructure splitting of one-electron atoms is 
explained as a spin effect. To obtain quantitative results the interaction 
Hamiltonian for the spin interaction is determined in Section IX.3. In Section 
IX.3a the magnetic moment of the electron is determined by classical argu- 
ments; in Section I[X.3b the magnetic field, which acts on the electron 
magnetic moment in an atom, is presented. In Section IX.4 these results 
are used to calculate the finestructure splitting. In Section IX.5 selection 
rules for dipole transitions are derived, and the chapter closes with some 
general remarks concerning the visualization of quantum systems. 


IX.1 Introduction 


The existence of electron spin was suggested by the fine structure in atomic 
spectra. (See the description of the experimental situation at the end of 
Chapter VI.) The electron spin cannot be expressed in terms of the position 
and momentum operators of the electron. If the electron is considered as a 
physical object with translational and rotational degrees of freedom, then 
spin is an observable that corresponds to the rotational degrees of freedom 
in the same way that momentum corresponds to the translational degrees 
of freedom. Thus the electron is an elementary rotator (Section V.1) with a 
translational degree of freedom. In the preceding chapters we have ignored 
the rotational degrees of freedom of the electron, as their contributions to the 
energy of the electrons bound in atoms is small. We have shown in Chapter III 
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Figure 1.1 Splitting of a particle beam after traversing a Stern—Gerlach 
magnet. 


that angular momenta of any integral and half-integral value may exist and 
that there is nothing peculiar about haif-integral angular momenta. It turns 
out that the spin (intrinsic angular momentum) of the electron is 4. 

A direct verification of the electron-spin hypothesis came from the Stern-— 
Gerlach experiment. The experimental arrangement is shown in Figure 1.1. 
A strongly inhomogeneous magnetic field was produced between pole 
pieces P,; and P,, of which P, has a sharp edge. A beam of hydrogen atoms! 
in the ground state was sent closely past the sharp edge of pole piece P,, 
eventually hitting plate 7. With no field the beam formed a narrow line on 
the plate (Figure 1.1 dashed line). When the magnetic field was turned on 
the line divided into two lines. (Figure 1.1 solid lines). 

Without going into the details for the description of this experiment (see 
Chapter XIII), one can already see that the model of the hydrogen atom 
described in Chapter VI cannot explain such a separation. According to that 
model the ensemble of hydrogen atoms with lowest energy is in a pure state 


W = Naa=1); (1-1) 


because A(n = 1) is a one-dimensional space. According to the model of 
Chapter VI, it would thus be impossible to separate the ensemble of the 
original beam into two subensembles as has been done by the magnetic 
field. The experimental separation into two subensembles shows that in- 
stead of A(n = 1) we must have (at least) a two-dimensional space. If we 
assume that the ensembles of the separated beams are in pure states, then 
we must have an exactly two-dimensional space. Within a certain degree of 
accuracy (including only the electron structure and disregarding possible 
structure of the nucleus) this assumption has so far been upheld. 

As the separation of the beam was caused by a magnetic field, one con- 
jectures that the new observable, whose existence has been demonstrated 
by the Stern-Gerlach experiment, must be connected with the magnetic 
moment. As the magnetic moment is connected with rotating charges, one 
would suspect that this new observable must be angular momentum. 


! The original experiment used silver atoms. The experiment has since been repeated with 
other atoms, including hydrogen. 
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1X.2 The Fine Structure— Qualitative Considerations 


The angular momentum that causes the separation of the beam in the 
Stern—Gerlach experiment cannot be the orbital angular momentum L; = 
€:j;,Q;P,, because this is zero for the ground state of the hydrogen atom. 
Furthermore, there are no two-dimensional spaces of states for the algebra 
of orbital angular momentum as | = 0, 1, 2,... (Problem III.1). The two- 
dimensional representation space of the algebra of spin angular momentum 
is #)='/? (Section II1.3), containing the two one-dimensional spaces of 
states with opposite helicity. Denoting this two-dimensional space by 2°, we 
may write 
a= RN” Gy Bie! ®@ Me yo (2.1) 
We thus conjecture that the electron is an elementary rotator and the 
electron in the hydrogen atom is a combination of the physical systems of 
an orbiting electron and this elementary rotator (spinning electron), the 
space of states of which we call 2°. Then the space of physical states of lowest 
energy of the hydrogen atom is 


wn 1) — ain 1) 3" (22) 
and in general the nth space of the hydrogen atom is given by 
Hn) = Rn) ® *. (2.3) 


The orbital observables (functions of Q;, P;) A; and L;, given by (VI.3.1), 
(VI.3.3), and (VI.3.9), act in the space #@(n), and the new angular momenta, 
which are called spin and denoted by S,, act in the space 2°. 

To check whether our conjecture is in agreement with experiment, we 
reduce the space #(n) with respect to angular-momentum spaces. According 
to (VI.4.4), each space A(n) is the direct sum A(n) = S723 @ #'. Then 
HA (n) given by Equatioh (2.3) is the direct sum 


nl nal 
HE (n) = » ® a") @*= Y @(#7'@r) (2.4) 
L; \l=0 l=0 

R' ® 2° is the space of physical states of the combination of two elementary 
rotators with angular momenta / and s = 5. The orbital angular momenta 
L; = €j.Q;P, act in Z', and the spin angular momenta S, act in »*. The total 
angular momentum of the combined elementary rotators is then, according 
to Equation (V.2.3), 


J,=L,@1+1@S,. (2.5) 


Using (V.2.38) we can reduce #! ® ** to the sum of total-angular-momentum 
eigenspaces 


Eta if | = 0, 


Ri="* 112 & Qi='- 12 otherwise. Cc 


weav=| 
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Whether the #/ or the #' ® »' are the spaces of physical states depends, of 
course, upon the preparability of these states. If physically preparable states 
are eigenstates of the total angular momentum, then the #/ are the spaces 
of physical states; if the physically preparable states are eigenstates of the 
orbital and spin angular momentum, then the #' ® *° are the spaces of 
physical states. The experimental data show that the physical spaces are 
the #’, for each orbital-angular-momentum level (except the | = 0 level) 
splits into two sublevels as required by (2.4) and (2.6). 
From (2.4) and (2.6) it follows that 


HA (n) = Ril? 0) © (Ril2 ® RG? 1) ® (Riz @ Rel? 1) 
@---@ (Mian2 1) @ Mian). (2.7 
Forn =|, 

H(n => 1) = Ril 0), (2.8) 
and there is one energy level corresponding to this two-dimensional space. 
The two-dimensionality of #(n = 1) allows us to explain the separation 
of the beam in the Stern—Gerlach experiment into two subensembles, one 


corresponding to the spaces #fj'2o) ;,= + 1/2 and the other to Bi/2o) j= -1)2- 
Form = 2, 


HK (n = 2) = Rien @ Wea) e) Ril? 1), (2.9) 
and there are three energy levels, each of which corresponds to one of the 
spaces in (2.9): 

Ruz 9) <> 25? Bil2,.<>2p'/? and Bi? 1) 2p*”?. (2.10) 
Forn = 3, 
Hn = 3) = Rizo, ® Fiz 1) O Be2 1) B Biz 2, @ Riz. (2.11) 
which belong to the energy levels 
35 1/2 ape Bp ol 3q3/2 3q5/2 


respectively. For n = 4, 5,... the same arguments hold. These results reflect 
precisely (if hyperfine splitting is neglected) the experimental data shown 
in Figure VI.5.2 for hydrogen and Figures 2.1, 2.2, 2.3 of Chapter VII for 
the alkali atoms. Thus to each energy level there corresponds an eigenspace 
of the total angular momentum; eigenspaces of J* are also eigenspaces of 
H, and 

(H, J;] = 0. (212) 
This equation which we obtained here from the empirical facts, expresses the 
rotational invariance of H and can theoretically be obtained from general 


symmetry arguments. 
We shall denote by #° the space # given by (VI.5.2): 


HH = Y © An). (2.13) 
n=1 
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The operator H given by (VI.3.2) or (VL.3.5) is not the total-energy operator 
when spin is taken into account; let us rename this approximate energy 
operator H,: 


EE a 
1 (2.14) 
(or 
Pp? 
Ho => - o + V(Q) (2.14’) 


for the alkali atom). The space of physical states of the hydrogen atom ina 
description that takes spin into account is then 


fe. 9) 


H =H Oe = (s ®@ a) on - Hn). (715) 


n=1 


This description is the combination of the spinless hydrogen atom and an 
elementary rotator. A basis system in this space is thus 


Inim> ®|s =4s3), (2.16) 
where the 
mim) (n =1, 2, 3.95. 3) — 0, 1, 2, nn — ben ee 
form a basis system for #°", and the |s = 453) (s; = —4, +4) forma basis 


system for 2‘ = '/?. Each observable A in ¥ is written, according to the 
basic assumption IV, 


A=) Avy ® AG", (2.17) 
ij 


where the fj” act only in #°° and the Af?" act only in * [cf. (III.5.7)]. 
Orbital angular momegtum and spin are special examples of this: 
L, = Li? @ PP" = (eg OF? Pe?) @ PP" = L, @ 1 = (Ci OjP) @L. 
(2.18) 
S,= 1°! os i 5; (2.19) 
(For typographical convenience we shall usually omit the superscripts 
“orb” and “spin” if it is clear from the context in which space an operator 


acts.) The total angular momentum of the combined system is the sum of 
these two angular momenta, 


J,=K,+8,=L,@14+1@5S,. (2.5) 


The energy operator H can also be written in the form (2.17). Since Hy @ I 
already gives a very good approximation to the energy spectrum (cf. Figures 
VI.5.2, VII-2.1, VII.2.2, and VII.2.3), we shall write the energy operator as 


H=H +H, (Ho =H, @)D), (2.20) 
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where H, acts on the entire space # = #° @ + and is small in the sense 
that its contributions to the energy values are small compared with the 
values of Hy. H, cannot be of the form H, @ I. If it were, no fine-structure 
splitting between levels with the same / could occur, for H%‘° could just be 
added to H@'° and the resultant H = (Hy + H,) @ / (for a given value of J) 
would only lead to a shift of the energy levels. Neither can H, be of the form 
I @ H,. For # is a very simple two-dimensional space: Every operator in 
2° can be written as a linear combination of the four operators I, S,, S,, and 
S,; thus H, would have to be H, = 0°! + ¥3., a!S;(a'e€C). The first 
term would only lead to an overall shift of all energy values. The second 
term ()?_, a‘S;), and consequently H,, does not commute with J,, because 


Le S;] = i€; i, Sx (221) 
as a consequence of (2.5) and of 
[S,, S;] = 1€; j, Sy (2.21a) 


Thus it would follow that H, and consequently H = Hy + H, is not rota- 
tionally invariant, in contradiction to (2.12). Hence H, must be of the form 
(2.17). Now the most general possible form of H, is 


3} 
H,=A@QI+ > B,@S,, (2.22) 
i=1 

where A and B, are operators in #°, i.e., functions of the operators Q,, P;. 
A is a scalar operator, while the B; are the components of a vector operator 
with respect to the L;. The former follows from the fact that every operator 
in 2° = A} can be written as a linear combination of the operators J, S,, 
S,, and S,. The latter follows from the requirement that 


[H ud il = 0. 
which in turn follows from (2.12) (2.20) and [Hy, L;] = 0. 
To see this we calculate 


3 


> [B, & Sk; Ji] aa [B,. Lj] ® Si, ap 1€,;; By & Sj, 


k=1 
which is zero only if 
[B,, Lj] = i€;;B;, (2.22b) 
i.e., if B, is a vector operator with respect to the L;. 
As 
[H,, Lj] #9, (2.22a) 


which follows immediately from (2.22b), the basis vectors given by (2.16) 
cannot be eigenvectors of H. Physically preparable states appear to be 
always energy eigenstates or mixtures of energy eigenstates (cf. Chapter 
XII). Therefore we should use a basis of eigenvectors of the energy operator 
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H. Because of (2.12), eigenvectors of the total-angular-momentum op- 
erators J*, J, can be eigenvectors of H. Therefore we use (V.2.12) to form 
the new basis vectors 
Inis =3jjs> = y |nJl;> @|s =453><I135 =483lfjs>, (2.23) 
1383 
where </l;453|jj3> are the Clebsch-Gordan coefficients given in Table 
V.2.1. The basis vectors (2.23) are eigenvectors of the c.s.c.o. 


Ho. L S*. es J; C (2.24') 
As S? is 4(4 + 1) on the whole space .¥, we can ignore it and have in 
gale eae (2.24) 


ac.s.c.o., of which the basis vectors of (2.23) are eigenvectors. The basis (2.23) 
need not be a basis of eigenvectors of H. If the basis vectors (2.23) are not 
eigenvectors of H, then eigenvectors of H can be obtained from (2.23) by 
perturbation-theoretical calculations that affect only the quantum number 
n [cf. the remark on perturbation theoretical calculations preceeding Equa- 
tion (VII.1.4)]. The reason for this is that the system 


i, Lede ds (2.25) 
is also a c.s.c.0., the eigenvectors of which we call 
|E ljjz>. (2.26) 
The difference between (2.23) and (2.26) is that 
Ho|nljj3> = E,|nljjs> (2.27) 
where E° = —a/(2n7), and 
~ iE ljj,>— BE 1333) (2.28) 


where E is a yet to be determined value. Thus the basis systems (2.23) and 
(2.26) have all but one quantum number in common. 
To prove that (2.25) is indeed a c.s.c.o. one needs only to show that 


(Lt, Lo (2.29) 


This cannot be derived from the general form (2.22), but constitutes a 
condition on the A and B;. The justification of (2.29) comes from the con- 
siderations in Section V.4 on parity, also expressed by (VI.5.9), and a new 
assumption about H: 


[H, Up] =0, or UpHU;! =H. (2.30) 


This assumption is called parity invariance (of H). Up in the spin space is 
given by 


Up|s = 7 53> = Ms|s = 4 53), (2:31) 
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where |z,| = 1. Equation (2.31) is (V.4.20) for the special case that j =s = 4 
and follows from (V.4.3) for the S;, i.e., from 


UPS, os = S;. (2232) 

Equation (V.4.20) and (V.4.21) for the orbital angular momentum (j = J) 
give 

Upleviy=— (— 1) noale tia): (2.33) 


here € may be any additional quantum number whose operator commutes 
with Up,e.g., € =n or € = E. From (2.31) and (2.32) it follows that 


Up|olls> @|s = 383> = (—I)'n|E1l3> @|s=453>. (2.34) 
(yn = Now»). It then follows from (2.23) that 


Urleljjs> = (—D'nlE ljjs>. (2.35) 


Consequently the eigenvalue of L? is connected with the eigenvalue of Up, 
and the quantum number / in the basis (2.23) may be considered the parity 
quantum number. [The parity can have only the two values + 1 and —1; 
thus for given n, j, j, there exist two states of opposite parity. This has already 
been expressed in (2.7) in the form that to every j there belong two I’s, | = 
j+4and!=j — 4.] Thus if (2.30) holds, € in (2.35) may be E, the eigenvalue 
of H; from (2.30) it follows that E, |, j, j, may label the states, and conse- 
quently that (2.25) is a c.s.c.o., one which has now been shown to be identical 
with the c.s.c.o. 


Hep. J, 35. (2.36) 

Now that (2.29) has been established, it can be used to obtain some new 
conditions on the operators A and B; in (2.22). From (2.22) follows 

Dee peae G27) 


which does not give any new restrictions on A, but, along with (2.22b), does 
restrict the B;: 


0 = [B,, L7] = {L,, [B,, L,]} = ie (Li, B;}. 
From this follows the restriction 
B, = f(Q, P)L;. (2.38) 


where [f(Q, P), L;] = 0, ie., f(Q, P) is a scalar operator with respect to 
the L;. We will later determine B; by correspondence from some physical 
considerations with the classical case, and will then see what functions of 
Q and P f is. 

Although we have denoted the basis vectors (2.23) and (2.26) differently, 
it may happen that they are identical (up to a phase). We have to distinguish 
the two possibilities 


[H,, Ho] =0, which implies [H, Ho] = 9, (2.39) 
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-~ [H,, Ho] #0, which implies [H, Ho] 4 0. (2.40) 
If (2.39) holds, then the basis vectors (2.23) are also eigenvectors of H and 
H|nljjzs> = E,y|nljjs>; (2.41) 
the eigenvalue of H may in general depend upon x, I, and j, and is given by 
Ey = ES + eu ER = — 53s (2.42) 
where €,,; 18 the eigenvalue of the operator H, in the state |n1jj3>: 
H,|nljjzs> = €yj|\nlijs>. (2.43) 


If (2.40) holds, then (2.23) and (2.26) are different vectors, and (2.26) can be 
determined from (2.23) by perturbation theory. The first-order perturba- 
tion expression E‘}) for the eigenvalue of H is then given, according to 


(VIII.1.33) or (VIII.1.38), by 


Boy) = Ey ot Gu (2.44) 
where ¢{}) is the expectation value of H, in the states (2.23), 
Enly = <n1jj3|H,|nljj3>. (2.45) 


IX.3 Fine-Structure Interaction 


The calculation of quantitative results requires knowledge of H,; H, can 
be conjectured by some plausible physical arguments from the correspond- 
ing classical situation. The fine-structure term in the energy operator con- 
sists of two contributions: (1) a contribution caused by the interaction 
between the internal magnetic moment of the electron and the magnetic 
field in the electron rest frame due to the motion of the proton charge as 
seen in this frame, and (2) a contribution caused by the variation of the mass 
of the electron with its velocity. Both terms arise from relativistic kine- 
matics; although the last term is of the same order of magnitude as the 
magnetic-moment term, it does not contribute to the splitting of the levels 
with the same values of n and /. Besides the fine-structure terms there are 
other contributions to the energy operator, which cause the Lamb shift and 
the hyperfine structure, but we shall not consider those here. 


IX.3a The Magnetic Moment of a Spinning 
Particle in Classical Physics? 


The energy of a magnetic dipole of magnetic moment m in a magnetic field 
of strength H is 


E® = —m-H. (3.1) 


2 Corben (1968). 
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So we need to know the magnetic moment m of the spinning electron and 
the magnetic field H in the electron’s rest frame. In this subsection we shall 
determine m, and in Section IX.3b we shall discuss H. 

Before we consider the magnetic moment of a spinning particle, let us 
recall the connection between the orbital angular momentum and the 
magnetic moment caused by the revolution of a charged spinless particle 
around a center. According to classical electromagnetic theory, the magnetic 
dipole moment of a point charge at position x with mass m, and charge 
(—e) moving with a velocity v is given by 


m = — (—e)x x v= — 


x el. (3.2) 


1 
2m,c 
(Note that we consider negatively charged revolving particles like the 
electron, and that e = +4.803 x 107 !° esu.) The connection between the 
magnetic moment of a spinning charged particle and its internal angular 
momentum (spin) s differs from (3.2) by a factor of g, + 2, the Landé factor, 
i.e., the gyromagnetic ratio is e/(m,c) and not e/(2m,c) as in (3.2). In order to 
derive this factor we consider the spinning point particle. 

A classical spinning particle is a physical system with two different 
dynamical variables, the momentum p and the intrinsic angular momentum 
s. If x denotes the position, then the total angular momentum j is given by 


f= s—x< < pes: (3.3) 


The external force F acting on the particle is given as the time rate of change 
of the momentum of the particle, 


dp 
F = — 3.4 
Ht (3.4) 
and the moment M about the origin of the forces acting on the particle 
is defined as 


Mix er (3.5) 


the first term representing the orbital contribution to this moment from the 
resultant force F, and the second term describing an extra applied torque 
T. This torque may arise, for example, from the effect of an externally ap- 
plied magnetic field upon any intrinsic magnetic moment the particle may 
possess. We now postulate that 


(3.6) 


so that, from Equation (3.3), (3.4), (3.5), and the definition v = dx/dt of the 
velocity, 


FvxpaT. (3.7) 
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If the momentum p and velocity v are parallel, or if either of them vanishes, 
then 


pak = (3.8a) 
and 


oF aaa F. (3.8b) 
Spin and orbital motions are then separately and respectively determined 
by the torque T and the moment of the force F, so that the spin and orbital 
motions become uncoupled and each can be discussed without reference 
to the other. Since in nonrelativistic mechanics p = my, it is possible in this 
approximation to discuss the orbital motion in terms of (3.8b) and the spin 
motion in terms of (3.8a). For example, in the equation describing the 
precession in a magnetic field H of a particle that possesses a magnetic 
moment m, the torque is 


— mo TL: (3.9) 
consequently 

ds 

a my x EL (3.10) 


Let us now assume that we have a particle that has no intrinsic magnetic 
moment, i.e., m = 0; consequently, by (3.9), T = 0, where T is interpreted 
as the torque acting on any intrinsic moment the particle may possess. But 
let us assume that v x p # 0, ie., that we do not have the strictly non- 
relativistic relation p = mv or the relativistic relation for free spinless 


particles p = mv/(1 — v7/c*)'/”.° Then (3.7) becomes 
de BRU 
—— x a 
SF vx Pp, (3.11) 


Suppose the particle ts moving in a uniform constant magnetic field H. 
The Lorentz force on the particle is then 


F = — = —— : 
Py ; vx H, (42) 
so that 
Midas dp e 


But, using (3.11) together with the supposition that H is a constant field 
(dH/dt = 0), 


ed eds é 
ee -H = —-—_—-, So = 
A ) c dt M ae 
= +<pxv-H= “pv xH. (3.14) 


>y x p 4 0 will occur if the time components s9,. 592, S93, Which together with Siz = € ijn St 
make up the relativistic spin tensor s,,., are not constant in time. This is always the case for an 
observer for whom the particle is moving with a velocity v. 
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Adding (3.13) and (3.14), we conclude 
tp? + “8-H = const. (3.15) 


Hence, supposing the particle possess a constant mass m,, it follows that 


ue ph-H = const, (3.16) 


where 


@ 


i= 5, (3.17) 


mc 
Equation (3.13) tells us that the kinetic energy p?/(2m,) of the particle is not 
a constant of the motion unless two of the vectors v, p, and H are parallel. 
In general, the particle has an extra energy —p-H in the magnetic field H 
[cf. (3.1)], ie., it behaves as if it had a magnetic moment p in the direction 
of its spin, with gyromagnetic ratio e/(m,c). Equation (3.17) is similar to 
(3.2), but with an additional factor of 2 (the value of the Landé factor g,) on 
the right-hand side. We are thus led to the conclusion that all charged spin- 
ning particles automatically possess a magnetic moment, given by (3.17), 
which is of relativistic origin. [The sign in Equation (3.12)—(3.15) and (3.17) 
reverses for positively charged particles. ] 

If the electron is a quantum-mechanical particle with charge —e and 
without intrinsic magnetic moment, then its magnetic-moment operator 
should be given by the quantum-mechanical analogue of (3.17), i.e., by 

= S20 = SS (3.18) 
2m,c 


m,.c 


where S is the spin operator. It turns out that (3.18) indeed gives a very 
accurate description of the electron magnetic moment. The deviation from 
the g, = 2 value for the electron comes from the radiative corrections of 
quantum electrodynamics and is of the same order as, and of analogous 
origin to, the Lamb shift. Including these corrections up to the second order 


gives 
2 
Ae) ( 205 = = 0;28(*) ), 
uh “8 


where « = e”/(hc) is the fine-structure constant; this value for g, agrees with 
experimental values up to the eighth decimal! place. Thus electrons ~and 
also muons—are particles without an intrinsic magnetic structure. 

The value g, = 2 was first established as far back as 1915 by an experiment 
of Einstein and de Haas, and was incorporated in the spin hypothesis put 
forward around 1926. The existence of the radiative correction terms was 
first discovered experimentally by Rabi and collaborators in 1947, and was 
calculated within the framework of quantum electrodynamics by Schwinger 
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in 1948. The value g, = 2 is also obtainable from the assumption of “minimal 
coupling to the electromagnetic field,” first used in the Dirac relativistic 
wave equation for the electron and considered one of the great achievements 
of the Dirac equation. The above argument shows that it is already a con- 
sequence of classical considerations. 

There are elementary particles with intrinsic structure; e.g., the proton 
has a magnetic moment 


(g e 


P 


(2 + 3.59)S, (3.18p) 


pe 


i.e., a g-value of g, = 5.59. The value of the magnetic moment in excess of 
the value 2(e/(2m,c))S given by (3.17) is called the anomalous magnetic 
moment; it is 1.79e/(m,c) for the proton. The neutron has no charge, and 
therefore the magnetic moment given by (3.17) is also zero; however it has 
an anomalous magnetic moment 


€ 


M. = 
- le 


(0 = 1.928. (3.18n) 


IX.3b The Spin—Orbit Interaction Term 


The determination of the magnitude of the magnetic field H must take into 
account relativistic kinematical effects. We go into a coordinate system that 
moves with the electron around the proton. In this system the electron is 
at rest, and the proton charge moves with a velocity v that is equal in magni- 
tude but opposite in direction to the electron velocity. This movement con- 
stitutes a current. The magnetic field caused by a current of a single charge 
+e moving with a velocity v is, according to the Biot-Savart law, given by 


Mi oS dS 


H(x) = +e (3.19) 


(e is measured in esu, H in gauss), where x is the vector of magnitude r from 
the moving charge to the observation point. The angular momentum of the 
electron is 

li =x x (—m,9). 
Consequently the magnetic field at the position of the electron caused by 
the rotating proton is 


he 


I (3.20) 


m.cr? 


Both (3.19) and (3.20) neglect relativistic effects. From (3.1) we thus obtain 


Ee aio al I. m, (G21) 


mc r 


e 
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for the energy of the magnetic moment in this field. The subscript “rf.” 
denotes that we have used a rotating frame. If the frame rotates, there is an 
extra contribution to the energy that reduces (3.21) by a factor of 4. [This 
factor is known as the “Thomas factor,” and is caused by the “Thomas 
precession”; a detailed calculation may be found in Jackson (1975, p. 364). 
Thus the energy of the moving spin magnetic moment in the magnetic field 
of the proton is given by 


e 1 
2m,c r° 


E™ = — I-m,. (3.22) 

We obtain the quantum-mechanical expression corresponding to (3.22) 
by the usual procedure of replacing the classical quantities ] and m, by the 
quantum-mechanical observables L amd M, given by (VI.3.1) and by 
(3.18): 


Hw = + ee L gee a Ea L,S; (usual cgs units) 
PST 9s am? O38? ~ Dime? Qt” . 
1 
= 9.llp Hy LS; (h = 1). (3.23) 
Q 
The quantity 


_ eh — (48 x 107*° esu)(6.6 x 107 *° eV sec) 
MB 3m.c  2(3.0 x 10° cmjsec)(9.1 x 102° g) 


o-8 esu eV sec? cm?/? sec 


= 0.579 x 1 = 0.579 x 10-8 OMS pe 


= 0.579 x 10° ® eV/gauss = 9.27 x 10° *' erg/gauss 


is called the Bohr magneton. The operator L-S of (3.23) is easily calculated 
from (2.5), 


J? =(L+S) =L’? +S’ + 2L-S, 
and the fact that 
S?|nIjj3> = s(s + 1)|nljjs> = alnljjs>. 
We then obtain for (3.23) that (in the usual cgs units) 


2 
Him 12 
2 mic? 


Oe =e), (3.24) 


1X.3c The Kinematical Correction Term 


We obtain the contribution arising from the relativistic mass effect by 
expanding the relativistic expression for the kinetic energy 


UN J(m,c?P + ca ig 
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of the free electron in powers of p/(m,c) [p = (p*)"”7]: 


2 2 4 
2 ee La eee yy Pf? eee ee 
Po=mec~ {1+ S = M,C ( =F 5 (2. 3 (-] + 


2 PRN PS 
ee. Coe! | eee 
ee oe > yoo 


+ 


The rest energy m,c? of the electron is ignored because nonrelativistically 
the energy is determined only up to an additive constant (the rest energy 
would shift the energy levels by the same fixed amount). Thus classically 
the kinetic energy with the first-order relativistic correction is 


pn =P _if(Py_t 
xin" 2m, 2\2m.) m,c?’ 


in contrast to the usual expression 


Ekin =a 


2m, 


Going to quantum mechanics, we replace the numbers p; by the operators 
P,; and obtain 


1 P2 2 1 e2 2 
H® = — >{——} =- >{H)+— 325 
; 2mee" (= Zine ( ° 5 | Ce) 
for the kinematical correction to 
oe 
H, = Sa reek 
. gees ee) ( ) 


[Both this equation for Hy and (3.25) are in the usual cgs units, as opposed 
to the units described above (VI.3.4).] The total-energy operator with the 
two corrections is then 


H =H, + H® + H®, (3.26) 


where H{” and H{? are given by (3.24) and (3.25), respectively. 


1X.4 Fine Structure of Atomic Spectra 


As H, = H{" + H\ contains the operators Q* (k = —1, —2, —3), H, and 
therefore H do not commute with H). It therefore appears that we have the 
situation described by (2.40), (2.44), (2.45), that |n/jj,> are not the physical 
eigenvectors and that E,,; given by (2.44) and (2.45) is only a first approxima- 
tion. Without further justification (which comes from a relativistic theory), 
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we state that the hydrogen-atom energy operator that includes fine-structure 
effects has eigenvectors that are not eigenvectors of the operator H, but 
that are again labelled by n, I, j, and j;, thus the eigenvectors (2.26) are also 
labeled by n. The values E\;) given by (2.44) and (2.45) are identical to the 
eigenvalues of this relativistic energy operator. 

In order to calculate the matrix elements (2.45) we make use of (VII.2.15), 
(VIT.2.16), (VIL.2.17) with (VII.2.20). [Note that for ! = 0 Equation (VII.2.17) 
is singular; for | = 0 the matrix element of j- s is also zero because 


<n Ijj3{L-S|nljjs> x 1/2 


—cf. (4.2) below—so that the matrix element of L-S/Q° is always well 
defined.] We calculate the matrix element of the spin-orbit term H%” and 
find 


2 
ne e ae - 
<nljjs3|HO |\nljjs> = Fnzez <M lislQ *InIjjs> 


" for j = 1+ 4, (4.1) 
—(l+1)/2 forj =1—4, 
because 
HJ? — L? — §1)|nljjs> = 206 + 1) — K+ 1) - Dn Ijjs> 
= 2lnljis) a + 1) 8 " , i ~ 
According to (2.23), 
<nljjslQ-*|nliis> = 2, dL Gislllss = 4 53> (Ils s = 355l iis) 


13's3' 1383 
x <53|85><n11,|Q-?|n 115). 


Using the orthogonality relation (V.2.15) of the Clebsch-Gordan coefficients, 
and the fact that the last matrix element is zero unless /, = 13, which we 
write as 


<n11s|Q-*|n1ls> = 5,,36n 110° *|nD, 
we obtain 
<n1jj3|Q-*|nljjs> = <nl|Q-*|n1). 
Thus from (4.1) and (VII.2.17) together with (VII.2.20) it follows that 


3,6 
ae 1 


Lf i \ om 
oe (m) mo = = = 2 =. a — 
COUUTISIEED es tat Vo » lea} ri ULES)! 
1 {1 forj=1+4, 
e+ 1) forts. 


(4.3) 
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Recalling that 


4 2\ 2 1 
va ee tL fe ee 4.4 
sh t 2p? (=} mee On?” Oe. 


(4.3) can be written as 


eal id+1) forj=1+4, 
c} n2l+t]-t1/l forj =1—4. 


(4.5) 


<nijjs|Hy|nljjs> = -F2/ 


The matrix element of the kinetic-energy term H{? of (3.25) is 
<n lj jz3|HY |nlijs> 


— 5g {EY + 28Qe*nIO™ nly + eXnl19-?[n 1D} 
m,C 


. eo \ceul 2e?_ m,e” 1 e*m? e* 1 
Se ee ee | eres 2 ya a) es 
c} 4n Pics ln 2m,c°E, (1 + 3)n 
e7\2 1 1 l 
Be Df ee a Pe 
ms ze(= } ae ene ass} 


eo les n 
= —F°(—]| — |--——}. 4.6 
Fe (©) =(j 74 eo 
In this calculation (VII.2.15) and (VII.2.16) together with (VII.2.20) have 


been used. Adding (4.5) and (4.6), we obtain for the spin-orbit interaction 
term 


iN? 
re oy al 1 73} n 
. = ee ; 4.7 
(nl jjs|Hsnljjs> E(=) aa (; 4) (4.7) 


Therefore the matrix element of H = Hy, + H, is (with h restored) 


a Mee | Pe eye ear i n 3 (48) 
nj = <n jisl|H|nljjzs> = E, Pg pel AL : 


The term in (4.8) that gives rise to the fine-structure splitting is the one 
proportional to 1/(j + 4). Recalling that «? = (e?/(he))? = (1/137), we see 
that this fine-structure term is four orders of magnitude smaller than E° 
and gives the experimentally correct splitting between the energy levels of 
different j (cf. Figure VI.5.2). Equation (4.8) is independent of | and therefore 
does not describe the splitting between 2S1/?, 2P1/?, 3P?/?, 3D?/?, etc., i.e., 
between states with the same value of j but different values of / or different 
values of parity. As mentioned above, (4.8) is also the eigenvalue of the 
relativistic energy operator, which does not include corrections for the Lamb 
shift and nucleon structure. 
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IX.5 Selection Rules 


Selection rules for dipole transitions have been discussed previously, in 
particular in Section V.4 for general angular momentum states and also 
briefly in Section VI.5 for the hydrogen atom without spin. The selection 
rules, 1.e., the rules that tell when the matrix elements <n/jj,|Q;|nljj3> 
are zero, follow from the property that Q; is a proper vector operator, i.e., 
that it fulfills the relations 


[L;, 0;] = i€; jn Oy, (5.1a)- 

LJi, Q;| = i€; jn, Qk.» (5.1b) 
and 

UpQ;Up = —Q,. (5.2) 


Equation (5.1b) expresses the fact that Q; is not only a vector operator with 
respect to the orbital angular momentum (an L-vector operator) but also a 
vector operator with respect to total angular momentum (a J-vector op- 
erator). Equation (5.1b) follows from J; = L; + S; together with (5.1a). 

As a consequence of (5.1b), the selection rules for dipole transitions between 
the physical states |n1jj3> for the hydrogen atom (and all other one-electron 
atoms) are: 


VF F31Q|nljjs> = 0 (5.3) 
unless j’ = j + 1,j, or j — 1. From (5.2) it follows that 

<n’ l'j’ j3|Qilnljjs> = 0 (5.4) 
unless x(J)x(I') = —1 [i.e unless (—1)'*' = —1, as x(J) = (—1)']. From 


(5.3) and the fact that j’ = I’ +45 and j =! + 3, it follows that the matrix 
element is zero unless I! =/+ 2,/+ 1,1,!— 1,1 — 2. Thus (5.4) can be 
written 


«nlf 73|Q,|nljjs> = 0 (5.5) 


unless I’ = | + 1. Equations (5.3), (5.4) or (5.5) give the selection rules for 
dipole transitions in the hydrogen atom and all other one-electron atoms. 


1X.6 Remarks on the State of an Electron in Atoms 


We should close this part on the hydrogen atom with a remark. To conjecture 
the algebraic structure that is the mathematical image of the hydrogen atom, 
we made use of the classical particle picture of the Kepler problem, -e., 
we had as the point of departure for our conjecture the picture of an electron 
as a particle that moves in closed orbits around a center. And when we 
discussed spin, we even implied some analogy of the spin with the rotation 
of this particle around its own axis [in the comparision of (3.17) with (3.18), 
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for instance]. This was the classical picture, which is described by the mathe- 
matical relations between the observables when they are not represented by 
operators but by numbers. In quantum mechanics these mathematical 
relations are operator relations, and are not the mathematical image of a 
point particle revolving around a center and rotating around its own axis. 
The electron in the hydrogen atom is not a particle, and the spin is not a 
rotation around the particle’s axis. A quantum-mechanical object was called 
a particle when it was approximately a generalized eigenstate of the position 
operator, because localization is the characteristic property of a particle. 
The electron in the hydrogen atom is not in a generalized position eigenstate. 
Equally wrong is the view (which originates from the misinterpretation of 
the solution of the Schrédinger wave equation) that the electron in the 
hydrogen atom is a standing plane wave, i.e., the classical picture com- 
plementary to the particle picture. A quantum-mechanical object was called 
a wave when it was a generalized eigenstate of the momentum operator 
(Section II.11), because this state has the characteristic property of a wave 
(wave motion over all space). The electron in the hydrogen atom is not ina 
generalized momentum eigenstate. Thus neither of the complementary clas- 
sical pictures of the electron is applicable to the electron in the hydrogen 
atom. In the hydrogen atom the electron does not appear as a wave or 
as a particle, but in a form different from both, namely, as an angular- 
momentum and energy eigenstate, in which it has neither a definite position 
(particle) nor a definite momentum (wave), but does have a definite angular 
momentum (rotator). The classical picture that comes closest to this is that 
of a standing spherical wave. 


Problems 


1. (a) Show that every operator on *‘ is a linear combination of the operators 
I, S,,S,, and §,; as stated in the paragraph preceding equation (2.21). 
(b) Check to see whether there are other solutions of 


iin t{L,, Bj} = 0 
[the equation immediately preceeding (2.38)] besides (2.38), 


2. The proton possess a magnetic moment 


M, = ae (2 + 3.59). 

(a) Determine the Hamiltonian that describes the interaction of the electron’s 
spin with the proton’s spin in the hydrogen atom (hyperfine interaction). 

(b) Treating the spin-spin interaction Hamiltonian in part (a) as a further perturba- 
tion term on the hydrogen-atom Hamiltonian, determine the matrix elements 
of this spin-spin term between states |n/jj,> of |! = 0 and / = 1. See how this 
correction compares with the fine-structure correction. 


GARTER X 


Indistinguishable Particles 


In this chapter the basic assumption about the combination of identical 
physical systems is conjectured from the indistinguishability of these systems. 


X.1 Introduction 


The quantum-mechanical systems that we have considered so far have 
consisted of only one constituent of the same kind. The hydrogen atom was 
considered as one electron in an electric field. The vibrating diatomic 
molecule was described as the problem of one oscillator. The rotating 
diatomic molecule was reduced to the problem of one system rotating around 
a center. And when the vibrating and rotating diatomic molecule was con- 
sidered, it was described as the combination of one rotator and one oscil- 
lator, i.e., as one rotating oscillator. These systems are called one-particle 
systems. Thus one-particle systems are systems that consist of only one 
constituent of the “same kind.” 

We now want to consider many-particle systems, 1.e., systems that are the 
combination of many (N = 2, 3, ...) one-particle systems of the same kind. 
Let #,, 4H ,..., An be the space of physical states of the first, second, ..., 
Nth one-particle system. (All of the spaces -# ; are identical; the subscript 
serves only to identify a space with a particular particle.) We would expect 
from the basic assumption IVa that the N-particle system, which is the 
combination of these N one-particle systems, will have as its space of states 
the direct product space 

§ =H, 0 H2@---B Hy. Cel) 


Uae) 
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The algebra of the N-particle systems would be the direct product of the 
algebras of observables of the one-particle systems, i.e., the algebra is the 
set of all operators 


A=)'A, @ A, @--: @ Ay, (1.2) 


where A, is an element of the algebra of observables in #;. (In particular, 
if these N one-particle systems do not interact with each other, then all 
elements of the algebra of observables in § are of the kind A = A; ®1 @ 
1®---@1+1@A, @1@::-@1l+::-+1@1@-:- OI @ Ay.) 

That the N-particle system would be described by (1.1) we would expect 
from classical considerations. Classical particles can, in principle, be num- 
bered; i.e., we can imagine that each particle is given at some instant a label, 
and we can then follow the subsequent motion of each particle in its path 
and identify a given particle at any subsequent instant. This is not possible, 
however, for quantum-mechanical systems. Following the motion of each 
particle of a quantum-mechanical system means performing a series of 
position measurements. Each measurement—according to the axiom III— 
changes the state of the quantum-mechanical system in an uncontrollable 
fashion: If we have localized the particle in the neighborhood of a certain 
point, then we do not known what its momentum is and we do not known 
where it will go. The concept of path does not exist for a quantum-mechanical 
system (uncertainty principle). Thus in quantum mechanics it is, even in 
principle, impossible to follow each of a number of identical particles and 
thereby distinguish them. (By identical particles we mean particles that have 
the same observable values.) Thus: Identical quantum-mechanical particles 
are indistinguishable. 

We shall now give a precise, mathematical formulation of indistinguish- 
ability and then formulate the consequences as another basic assumption 
of quantum mechanics. Let |€;>; denote a basis in the space .¥ ,, i.e., the one 
symbol ¢; stands for the full set of quantum numbers (eigenvalues of a c.s.c.0.) 
necessary to label the basis system of .#;. [For the sake of definiteness, one 
may assume that the N particles are electrons in a Coulomb field; then each 
H# ; is the:space.# of (1X.2.15), and [€)»= |nj l,j; j;3>.] The basisin § of 
(1.1) is then given by 


[o1o2°** wd = 1611 @ 1622 @--- @ |Enyn- (1.3) 


Suppose we are given N objects (elements) in a certain order, 
Coe 3 


such an arrangement is called a permutation. These N elements can be 
written in a different order, (7,.2.....y): this is called a permutation of 
the N objects. There are N! different permutations of N objects. One par- 
ticular permutation can be considered to be the “original” or “natural” or 
“standard” one. All other permutations can be obtained from this original 
one by changing the order in which the objects appear. It is clear that the 
operation of changing the order is specified by the resulting permutation; 
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therefore this operation is also called a permutation. Thus we may consider 
the permutation operation (or just “permutation”) P that changes 
(C1,62,---5€y) into (4;,2,...,my). For example, we can consider the 
permutation, P,, that changes (¢,,¢5,....¢éy) into (np="¢,,n. = ¢,, 
N3 = $3,---.Nn = Ey). Or we can consider the permutation P;, that changes 
eG :, ec oy) Into (C,, . segues, ...,.¢;, . -aaiey) by interchanging 
the ith and jth elements; such permutations, which consist of the exchange 
of the position of two elements, are called transpositions. Every permutation 
can be obtained by a finite number of transpositions; e.g., the permutation 
(€2,63, C1, C4,---, ny) may be obtained by the transposition P,; followed. 
by the transposition Py: (C1; G25 C3> Gas 2) 4 (63, C2> Se (Ie on ) a 
(€2, 3,61, 4,--.). While the decomposition of a permutation into suc- 
cessive transpositions is not unique, the number of transpositions will always 
be even or odd, depending upon the particular permutation. A permutation 
is odd (with respect to the original permutation) if it is obtained from the 
original permutation by an odd number of transpositions; it is even if it 
is obtained by an even number of transpositions. 

Now let the N objects be the N sets of quantum numbers (€,, €2,...) = 
(ny Lyjii13, M2!2j2j23,.--). Each permutation (4,, 42,..., Ny), or each permu- 
tation operation P:(€,,€,,..., Ey) (m1, 2,---,4n) can then be rep- 
resented in the space § by a linear operator P defined by 


Py Gisoc ond = |i M2 °°° tn (1.4) 
For example, the transposition P,, is represented by the operator P,, with 
Py21o102 --- €w> = [E201 --* On. (E>) 


The operators P will be chosen to be unitary.’ 


[The set {P} ofall permutations of N objects forms a group known as 
the symmetric (or permutation) group, while the set {P} of all rep- 
resenting operators forms a representation of the permutation 
group. If all operators P are unitary, then the representation is 
called a unitary representation of the permutation group. Because 
the permutation group has a finite number of elements (is finite 
and consequently compact), every representation of the permuta- 
tion group, according to a theorem, may be considered unitary. 
We shall make use of only one property of the permutation group, 
to be stated below, and shall not require any group theory. | 


Let A;;) denote the projection operator onto the one-dimensional subspace 
spanned by |é> = |€, €, --- €y>, and let Aj,, denote the projector onto the 
one-dimensional subspace spanned by |7> = |, "2 --- Nw>- Because of (1.4), 
we have the connection 


Amy = In><n| = PIE><E1PT = PAje PI. (1.6) 


! The reason for the requirement than P be unitary follows from the fact that P is a symmetry 
transformation. A brief justification of this will be given in Appendix to Section X1X.2. 
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Now, due to indistinguishability, \ ,, and A ,,, can represent a (pure) physical 
state only if 


Miny = Ale» (1.7) 
or, using (1.6), 


Thus of the one-dimensional spaces spanned by |€,> @ |€,> @ --- @ |én>, 
only those for which (1.8) is fulfilled can represent physical states. From this 
we conclude that, because of indistinguishability, it is not the whole direct- 
product space § of (1.1) but only a subspace of it that is the space of physical 
states. Also, it is not the whole direct-product algebra of operators given by 
(1.2) but only a subalgebra of it that is the algebra of observables. We now 
want to determine the physical subspace of (1.1). It is clear that, in general, 
the direct-product basis (1.3) is not a suitable basis [(1.8) means P|n)> o |), 
which cannot be fulfilled if all the quantum numbers are different ]; and that, 
in general, the physical states are represented by linear combinations of (1.3). 

Let w be a vector of the physical subspace [in general, a linear combination 
of (1.3)], and let P be a permutation operator. Then, if the particles are 
indistinguishable, y and y = Pw [or A,, and A, = A p,] represent the same 
physical state. The expectation value of every observable A must therefore 
be the same for w and for y, ie., 


WIAW> = IAL = CHI PTAP|y> (1.9) 


for every observable A. As w is an arbitrary vector of the physical subspace, 
we conclude that 


A= PtAP (1.10) 


for every observable A. Hence for any Ae. and for any permutation 
operator P, 


[P, A] = 0. (1.11) 


[Equation (1.11) follows immediately from (1.10) for P unitary, 
which, as mentioned above, can be assumed for the permutation 
group. For P nonunitary, one must take linear combinations 
Ww, + ia, wy + ay, in order to deduce (1.11).] 


Equation (1.11) is the mathematical formulation of the statement that 
identical particles are indistinguishable. From this mathematical formula- 
tion of indistinguishability, one can deduce that the vectors of the physical 
subspace of § must fulfill either 


Py =+wy forall P (1.12) 
or 


p is even if P is even, 


p is odd if P is odd. (1.13) 


Py =(—1)?’! where 
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The wy fulfilling (1.12) are called symmetric, while those fulfilling (1.13) are 
called antisymmetric. 

To deduce this, we need the mathematical formulation of a physically 
obvious property of the algebra oJ = {A} of observables A. This can be 
formulated in the following way:* ./ = {A} contains a complete set of 
commuting operators. To justify this condition physically, we recall that we 
called a state “pure” (up to a certain accuracy) if there is no observable 
whose measurement allowed the separation of an ensemble in this state into 
two or more subensembles. If such a separation is possible (as in the case of 
the hydrogen atoms in the ground state), it leads to the introduction of a new 
quantum number and therewith to the introduction of a new observable 
(in the case of the hydrogen atom, spin). Consequently one must enlarge the 
algebra of observables to accomodate this new observable. What is a pure 
state with respect to a certain accuracy need not be a pure state with respect 
to a higher accuracy. But up to every desired (and observable) accuracy, 
every pure state is completely specified by a set of quantum numbers that 
are connected with observables. (In fact, the algebra of observables is con- 
jectured from these quantum numbers and observables.) Each label of a 
vector is connected with an observable, and the observables whose eigen- 
values label the vectors form a complete set. [As we have seen, the statement 
that a certain set of commuting operators is a complete system is a physical 
statement, conjectured from the physical properties of the system, and not a 
mathematical statement (cf. Chapter IV)]. Therefore the apparent occurrence 
of pure states not fulfilling (1.12) or (1.13) is always an indication that the 
available set of quantum numbers is not complete and that there exist other 
quantum numbers completing this set that have not yet been uncovered. 

Let us denote by 4,, A>,..., A, a complete system of commuting opera- 
tors of the algebra of observables, by |a> = |a,a, --- a,) the corresponding 
eigenvectors, and by /\,, the projectors onto the subspaces spanned by 
|a>. Ajay is the observable whose expectation value gives the probability 
of obtaining a,,a,,...,4, in a measurement of A,,A,,...,A,. AS ob- 
servables, the A\,, must commute with all permutations P in accord with 


Celi): 
[P, Ajay] = 90 for all P. (1.14) 


From (1.14) it follows that |a> must be an eigenvector of all P. 


Proof : Equation (1.14) applied to a vector |w> gives PA, |Y> = 
A ayP|w>. Using Aja, = |a><al, we see that Pla><a|lW> = |a> 
<a|P|p>, and hence that 


— jay LAlP ue 
Play = la) Ty (1.15) 


2 It is this condition that is relaxed when parastatistics are allowed. 


242 Indistinguishable Particles 


The possible eigenvalues, <a|P|w>/<a|W>, follow from the following prop- 
erty (which we shall not derive here) of the permutation group: The permuta- 
tion group has two one-dimensional representations: (1) the symmetrical 
representation in which all permutations are represented by the unit oper- 
ator I, 1.e., 


P|y> = + |W) for all permutations P; (1.16) 


and (2) the antisymmetrical representation in which all even permutations 
are represented by the operator I-and all odd permutations are represented 
by the operator — IJ, i.e., 


p is even if P is even, 


IF 
p is odd if P is odd. oe 


Ply> = (-1)" |W where | 


We recall that an irreducible-representation space (ladder representation) 
of the algebra of {P} is a representation space that is obtained by applying 
all P to one element of the space. The fact that |a> is an eigenvector of all 
P means that |a> spans a one-dimensional irreducible-representation space. 
Therefore, according to the above property of the permutation group, we 
must have either 


Pla>=+|a> forallP (1.18) 
or 
Pla) =(—1)?|a> for all P. (1.19) 


Since this is true for every basis vector |a>, and as every vector |W) of the 
representation space of the algebra of observables (i.e., of the physical sub- 
space of §) can be written as a linear combination of the basis vectors, we 
conclude that either (1.12) or (1.13) must be fulfilled. 

Let us denote by #*that subspace of § which consists of the symmetric 
vectors, 


HY = {lW>e §: Pl p> = |W}, (1.20) 


and let us denote by #% that subspace of § which consists of the anti- 
symmetric vectors, 


H*™ = {p> eH: Pip = (-1)? |p}. (1.21) 


We can then formulate the statement, which we have derived from indis- 
tinguishability (1.11), in the following way: The space of physical states of 
N identical quantum-mechanical systems is either the antisymmetric space 
H™ or the symmetric space #% . 

A basis of nonnormalized vectors in #™ is given by 


Io>+ = 1182 -+- w+ = D Pld Sa--- En, (1.22) 
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where |¢,¢2 --- €y> is given by (1.3) and )’p is the sum over all permutations 
P of the N objects €,, €,,..., Ey. It is clear that the order of the quantum 
numbers €,,¢,.-..¢n in |€>, is irrelevant. To prove that the |é>, are 
symmetric, we calculate for an arbitrary permutation P, that 


Pilo>4 = 2 PiPleie2 Supe ea ez os typ 1G; 


where we have set P’ = P,P and where the sum over all P becomes a sum 
over all P’. The latter follows from the fact that if P, is fixed and P runs over 
all permutations, then P’ runs over all permutations. Thus for any permuta- 
tion P,, 


Pale 7oe = Gres (1.23) 


A basis of nonnormalized vectors in #™ is given by 
[o>_ = 1€1E2 --- Eyr- = D (— 1 PIE. 62 «> En. (1.24) 
Pp 


To prove that they are antisymmetric, we first note that(—1)’ = (—1)?*?! = 
(—1)’(—1)” and hence that (— 1)? = (—1)?(— 1)”. Then 


Pie ae AS al sles eres cn» 
= ¥ (—1)?(—1)P P2162 --: én =(—DPld>-. (1.25) 
ry 


for any permutation P,. If the |€;>; are normalized vectors in #;, then the 
|€,2--- €y> are normalized in §, ie., (¢1¢2 °+* Cwld1¢2°** Sud = 1. Con- 
sequently the |é>, and the |€>_ are not normalized ; the normalizing factors 
are calculated in Problem XI.1. 

We state the result of the preceeding considerations as a new basic as- 
sumption (axiom) of quantum mechanics: 


IVb The space of physical states of N identical quantum-mechanical 
systems (particles) is #\ if their angular momentum (spin) has an integral 
value, and is #™ if their angular momentum has a half-integral value. 


Particles whose space of physical states is # and #% are called bosons 
and fermions, respectively. Bosons are said to obey Bose statistics, while 
fermions are said to obey Fermi statistics. The above axiom IVb then states 
that half-integral-spin particles are fermions, that integral-spin particles are 
bosons, and that there are no other particles obeying some other “para- 
statistics” (which would have to belong to higher-dimensional representa- 
tions of the permutation group). This axiom has been confirmed in all cases 
where it has been investigated (electrons, protons, neutrons are fermions; 
pions, photons, phonons, alpha particles are bosons). We remark that the 
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axiom IVb has been derived, to a large extent, from indistinguishability 
(1.11), which in turn was deduced from previously formulated basic assump- 
tions of quantum mechanics. The part that has not been deduced is the con- 
nection between spin and statistics. 

The Pauli principle in its original form follows immediately from IVb. 
The Pauli exclusion principle states: The quantum numbers of two or more 
electrons can never entirely agree. 


CHAPTER XI 


Two-Electron Systems— 
The Helium Atom 


The system with two electrons is studied in this chapter. Section XI.1 shows 
that the space of physical states of the helium atom is the sum of the para- 
helium and the orthohelium spaces. In Section XI.2 the ionization thresholds 
(i.e., the energy values E,,,, at which one electron is in the nth level and the 
other is just dissociating from the atom) are determined, and the energy 
levels below the first ionization threshold are discussed. Section X1.3 discusses 
the energy levels above the first ionization threshold without considering 
the interaction between these levels and the energy continuum of the (He™, e) 
system. 


XI.1 The Two Antisymmetric Subspaces of the Helium Atom 


We shall illustrate the consequences of the basic assumption IVb with the 
example of two electrons in a Coulomb field. This is the simplest nontrivial 
case; although it does not demonstrate the full extent of IVb, it is mathe- 
matically simple and does not require the introduction of further properties 
of the representations of the permutation group. 

The energy of two classical spinless particles of mass m, with charge 
—e that move in the field of a central charge Ze is given by 
Die mmace Ze. 
on (py + p2) — a + a 
where r, and r, are the distances of the first and second charges —e from 
the charge Ze, and where r,, is the distance between these two charges. 
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E= 


(1.1) 
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This system is the classical analogue of the helium atom (or of any two- 
electron ion if Z # 2). We obtain the energy operator of the helium atom 
by the usual procedure of replacing the numbers p,;, X,;, 1, = (x2)'/*, and 
ry. = ((%, — x)*)'” by the operators P,;, Q.:,Q. = (Qz)"””, and Q,2 = 
((Q, — Q,)'’”, respectively. In addition we have to add a term H, that 
describes the influence of the electron spin. Thus we have 

WA 2 Wi, 2 2 

= 4 4H; (1.2) 
Q, Q Qi 

for the energy operator of the helium atom (Z = 2), which we write in the 
form 


| 
H = — (Pi + P}) - 
2m, 


H=H,.+ H,. (1.3a) 


Hg is the Hamiltonian operator corresponding to the classical Hamiltonian 
given by (1.1): 


] 


where 
Hoo = h, ® I + I (65) pe (1.3c) 
2 
h, = ss = a (az =m.e7?Z/h, h = 1), (1.3d) 
and 
ae 3 -1/2 
Q12 i—=1{ 
The operator H and all the other operators act in the space 
SS Cit. (1.4) 


where #, is the space of the system that consists of one electron in the 
Coulomb field of the charge Ze. 

For two objects ¢, and ¢, there are only 2! = 2 permutations, (€,, €,) 
and (¢,, ¢,); therefore for a fixed set of quantum numbers €, and €, with 
€: # ¢2 there are only two basis vectors in §:|&,&€,> and |&,¢,>. The 
normalized symmetric and antisymmetric vectors for this fixed set of quan- 
tum numbers are, according to (X.1.22) and (X.1.24), 


1 
Sicepee ~ es ete) (1.5) 
and 
1 
Ker — a = [o2¢1>). (1.6) 


Thus for a fixed set of quantum numbers €, and €, with €, # €, we havea 
two-dimensional space spanned alternatively by |€,€,>,|€,¢,> or by 
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Ilo1o2>4,516:E)>_. If €, and &, are fixed with €, = €,, we have a one- 
dimensional space spanned by |é,¢,) = |€,€),. The space § is spanned 
by |¢,¢,>, where €, and €, independently can take any of the possible sets 
of values (nij,) (1 = 1, 2,...,f=0,1,....n-—1; j, = —j, —j+1,..., 
j;1=j + 4).* The space # is spanned by all the vectors |é,é,)_, and the 
space #7, is spanned by all the vectors |¢,¢2>4. Consequently 


Se F720 #2, (1.7) 


1.¢., the product space is the direct sum of the symmetric and antisymmetric 
subspaces. 


[Equation (1.7) is a particular feature of the case N = 2; for N > 2, 
§ of (X.1.1) is not the direct sum of the symmetric and antisymmetric 
subspaces (X.1.20) and (X.1.21). Rather 


H= HT OHS OX'@OLNO--, 


where there are a finite number of terms, as many as the ways in 
which one can write N as a sum J)’; N; of positive integers N;. For 
N = 2 there is only one way possible (2 = 1 + 1), and consequently 
there are only two different terms, as given by (1.7).] 


According to the axiom IVb, or the Pauli principle, only the subspace 
H* of (1.7) is the space of physical states for the two-electron system. (Were 
we considering a two-boson system, #7 would be the space of physical 
states.) To construct #2 and to find the properties of the algebra of ob- 
servables in #2 , we proceed in the following way: Each #, (a = 1, 2) is, 
according to (I[X.2.15), written as 


M027 On, (1.8) 


where .#°"” is the space in which the orbital observables (i.e., the observables 
that are obtained as functions of the P,; and Q,,;) of the «th electron act, and 
where 23 is the space in which the spin observables of the ath electron act. 
We now combine the orbital and spin spaces of the two electrons separately, 
1.e., we form 


aoe = Pate @ gee (1.9) 
and 
= 8 @as. (1.10) 


We then find the symmetric and antisymmetric subspaces of #°% and *°” 
separately by the same procedure as described above; the ¢, in (1.5) and 
(1.6) stand for €°> = (n,1,1,,) when considering #°™ and stand for & = s,, 
when considering 2°’. In this way we arrive at 


your aa Cane @ yeur? ate 1) 


1¢ > + €, then means not all (n,!,j;j13) agree with (21, jz j23). 
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and 
S =e Ore. (1.12) 
The total space § is then given by? 
§ = #° @ 
= (HE @ 0%) @ (HF @ 2) © (HZ @ 2%) © (HA @ 2®). (1.13) 
In Problem XI.2 it is shown that the symmetric subspace is 
H?, = (HEB 0%) © (HN @ 2%) (1.14) 
while the antisymmetric subspace is shown to be 
He = (HE Oe) @ (HE Oe). (1.15) 


Thus the space of physical states #? is the direct sum of two spaces: one is 
the space of symmetric orbital states and antisymmetric spin states, and the 
other is the space of antisymmetric orbital states and symmetric spin states. 

As has already been discussed (Section IX.2) in the case of the hydrogen 
atom, the basis vectors that are eigenvectors of the (total) spin and of the 
(total) orbital angular momentum are not a physical basis, because it is the 
(total) angular momentum that is the physical observable, and not the spin 
or orbital angular momentum. Thus to obtain the physical states one has 
to form those linear combinations of the direct product states | Eo"°é3">> , @ 
ISisS23>- (in HY @ 2%) and |EXPES)_ @ [5135234 (in HN’ @ rk 
that are eigenstates of the total angular momentum. Furthermore, it will 
turn out that the spaces #3!’ @ 2° and #° ® 2° are not eigenspaces of 
the energy operator H of (1.3a); the reason is that H, does not commute with 
the operator of total spin S; = S,; + S,;, but 2 and *° are eigenspaces of 
S?. If the physical states are eigenstates of H (as all experimental data con- 
firm) then the physical state vectors are elements of neither #°"* ® «°° nor 
KH ® +, but are linear combinations with a small component in one of 
the spaces and a large component in the other. Thus the reduction of #7 
into the direct sum given by (1.15) is only approximately physical; the 
subspaces #7" ® o and #°* ®@ »% are spaces of physical states only to 
the extent that the contribution of H, (the spin-orbit interaction) to H can 
be neglected. As in the case of the hydrogen atom, this will turn out to be a 
very good approximation. 

We shall now neglect H, and undertake a detailed construction of all 
four spaces on the right-hand side of (1.15). We start with the spin spaces 
7, and 2, as these are much simpler than the orbital spaces. The space 


e = 25 @ 4 is the direct product of 2 two-dimensional spaces and is thus 


* We wish to stress again that the appearance of only symmetric or antisymmetric subspaces 
is a particular feature of N = 2; for N > 2, higher-dimensional representations a, of the per- 
mutation group in #°'>™ and in ** also have to be considered. The antisymmetric space  % 
then contains not only the spaces #°" ® >‘ as in (1.15), but also contains all direct-product 
spaces of the form #°'*™ @ 2S", where o’ is the irreducible representation of the permutation 
group that is “associated” with the irreducible representation ¢ in such a way that #2" @ 7° 
is an antisymmetric subspace of §. 
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four-dimensional. Its direct-product basis is given by the four vectors 
IS13>1 @ |23>2 (Si3 = +3, S23 = +4). (1.16) 


One easily finds the antisymmetric and symmetric combinations of these 
four vectors. The symmetric ones are 


I>: @ld2, 1-1 @1-Ho, 
1 
and piles @®|—d2 +1-2>1 @13)2), (1.17) 


while the single antisymmetric vector is 


i 
Fld @1-Da — 1-1 BHD). (1.18) 


The four vectors of (1.17) and (1.18) are orthonormal and therefore con- 
stitute a basis in 2°’; consequently the three symmetric vectors span the sym- 
metric space #., which is therefore three-dimensional, and the vector (1.18) 
spans the antisymmetric space +’, which is therefore one-dimensional. 
e = R="? @ R2=1/2 is the space of the combination of two elementary 
rotators. We can thus apply the results of Section V.2 and define the operator 
of total spin, 


§,;=5,,@1+1@S,; (i = 1, 2, 3). (1.19) 
Equation (V.2.33) then tells us that 
o = B® Q~°, (1.20) 


ie., the total spin is s = 1 or s = 0, and we can introduce in 2 the basis 
|s s3> with 


S7|5 53> = s(s + ae can SS) (2) 


S3|ss3)> = )s3|s 53> s=1, s,; = —1,0,1. 


The observables S$; commute with the permutation operators (in this case 
the transposition operator P, ,), since 


P28; — Pi (Sj; @I+1@ S>i) = (I @ So; aT Si; 3) IP ;, = S;P 12. 
(1.22) 


The two subspaces +5, and »* are eigenspaces of P,, corresponding to the 
eigenvalues + 1 and —1, respectively. Because of (1.22), S; cannot transform 
out of either 2% or 2°’, ie., S; leaves 2 and 2* invariant. Thus S; leaves in- 
variant #=! and #~° on the one hand, and leaves invariant 7 and 2° 
on the other hand. Also, #*~' is three-dimensional, as is 2, and #°*° is 
one-dimensional, as is 2; consequently #°~' = 78 and #~° = °°. We 
can therefore write (1.15) as 


HH? = hem ® Ba) @ (le 2) R= 1), (1.23) 
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The space of physical states (neglecting the spin-orbit interaction) is thus 
the direct sum of a space in which the total spin is zero (space of singlet 
states) and a space in which the total spin is one (space of triplet states). 
This, as we shall discuss below, is the explanation for para- and orthohelium, 
first given by Heisenberg in 1926. 


XI.2 Discrete Energy Levels of Helium 


We will now investigate the structure of the orbital spaces in (1.11). In each 
KH” (a = 1, 2) we have a reducible representation of the algebra &(SO(4)) 
of the orbital angular momentum 


Lyi = Eijx Qaj Pax GA) 
and the Lenz vector 
azQ.,; 
Ag (2h ye (sent Pa, ee 22) 
( ) ( xh 1 aS 0, 3) 
which, according to (IX.2.13), is given by 
H" = © An). (2.3) 
n=1 
In the direct-product space 
wor’ = ae ® eu (2.4) 


we have a representation of the algebra of orbital angular momentum and 
the Lenz vector given by 


L=L,@1+1®@L,,, 


The operators of (2.5) are defined in analogy to the definition (V.2.3) of the 
total angular momentum of the combined system of two elementary rotators. 
It is easy to see that L; and A; obey the same commutation relations as L,, 
and A,,, i.e., the commutation relations of &(SO(4)): 


[L;, L;] = 16; je Ly, [L;, A;] = 16;;, Ay, [A;, A;] = ig;,L,. (2.6) 
We can define the operator 
C,=C,,@1+1@C),, (2 
where, similarly to (VI.3.13) and (VI.3.17), 


(2.5) 


Ze 2 
C4 SAP Ee Sth) hg, = 


(2.8) 


We can further define the operator 


oF = A;L; (2.9) 
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in analogy to (VI.3.14) for the hydrogen atom. The operators C, and C 2 
commute with A; and L;. However, these operators do not fulfill the same 
relations as the operators C,, and C,,. In particular, C, is not related to 
the energy operator Ho by as simple a relation as (VI.3.17), and C, is no 
longer zero. To see the latter, insert (2.5) into (2.9); one then calculates that 


C, = 1,;A,;,;@ 1+ 1@ L,;A2; + Ay; ® Ly; + Ly; ®@ Ar, 
which gives [making use of (VI.3.18)] 
ce => Aj; ® L,; + Ly; ®& A3;- (2.10) 


This is not, in general, identically zero. 
The energy operator of the helium atom in the approximation in which 
the influence of the spin is neglected is given by (1.3b): 


1 
Ho = — Hoo + W. (2.11) 


It is customary to consider first the term (1.3c): 
Hog = h, ©I +1 Qh. (2.12) 


Hoo is the energy operator for a system of two noninteracting electrons in 
the (nuclear) Coulomb field. As it is very unrealistic to neglect the Coulomb 
interaction between the two electrons (which is of the same “strength” as 
the Coulomb interaction between each electron and the nucleus), Hog is a 
very poor approximation to the energy operator of the helium atom. Thus 
we cannot expect that the spectrum of Ho, will give a good approximation 
of the energy spectrum of the helium atom. As we will see later, it happens 
that the qualitative features of Hoo agree with those of the energy operator 
Hy = (1/m,)Hoo + W;; this justifies the usual treatment of first considering 
Hoo separately. 
Using (2.8) and (2.12), Ho, can be written 


az 

2 oT fay = ey, 

The spectrum of Hoo is easily found. We introduce into #°" = #3" ® 
H;” the direct product basis 

li,>, ln ti >>. (2.14) 


As, according to (1.23), we want to known the spectrum of Hoo in the sym- 
metric subspace #2 and in the antisymmetric subspace .#°’, we in- 
troduce in #°" = #2" © # the basis system of symmetric and anti- 
symmetric vectors 


Hoo = — @1+1@—— (2.13) 


1 fo) bat 
~ guages 3>2 + |n'l'l5>, @ [n113>2). (2.15) 
Equation (2.15,) gives the basis system in Ho while (2.15_) gives the 
basis system in °°’, The vectors (2.15,) are eigenvectors of Ho and 
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together form a complete basis system in KH’. The spectrum of Hop is 
therefore obtained by applying (2.13) to (2.15+); the result is 


H , wel 1 
spectrum sel =1Hy, @=— Rae (os +) (2.16) 
where 
a az= 2 =f m,e* = 4R" = 54.4 eV. (2.17) 
ph as) 2h? 


“1¢ differs from the Rydberg constant for the hydrogen atom by a factor of 4. 
It is the Rydberg constant for a one-electron system in the Coulomb field 
of a charge Ze = 2e, i.e., for the He* ion. Its value in cm~ ' (wave-number 
units or inverse wavelength units) is 

1 54.4 eV 


= —— Hf = —— on = 4,39 10° . 
Ruts Qeee © 10 10 eVien a 


The basis vectors (2.15+) are not eigenvectors of the total orbital angular 
momentum L? and L,, and are not eigenvectors of the energy operator Hy 
(recall that Hy is the energy operator if the contribution of the spin H, is 
neglected). If the physical states are eigenstates of the energy operator H, 
they are very closely eigenstates of H). Eigenstates of Hy can be eigenstates 
of the total orbital angular momentum L? and L;, but can be neither the 
direct-product states (2.14) of angular momentum ! and /' nor the particular 
linear combinations (2.15+) of those direct product states. [The states 
(2.15+) are eigenstates of L? + L3. But 


[L? + L3, 0%] = —2[L4,L2,, (Q1; — Q2)(Q1; — 22] 
= —41€,;(L4,Q2;91;1 — Q1:02;L) # 0. 


Consequently the vectors (2.15) are eigenvectors of an operator that does 
not commute with Hy. f One therefore has to couple the angular momenta | 
and I’ in (2.15 +), according to the rules of Section V.2, to obtain eigenvectors 
of the total orbital angular momentum. These eigenvectors, which are formed 
as linear combinations of (2.15+), are not yet Hy eigenstates but still Hoo 
eigenstates; Hy eigenstates can then be formed as linear combinations with 
the same value of total orbital angular momentum. 

Instead of approximating the helium atom by a model system that con- 
sists of two noninteracting electrons that move in the Coulomb field of a 
doubly charged nucleus, one can try to approximate it by a model system 
that consists of one electron moving in the electric field that is formed by the 
doubly charged nucleus and the other electron. This model is certainly much 
more realistic if the one electron is ~in the classical picture—far away from 
the nucleus and from the other electron, which are close together. To obtain 
this approximation we write 


I 1 
Hy = A. h, + HS os an h, SF ale (2.18) 


€ € 
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where 


(2.19) 


More precisely, 


2 


1 e 
Hi Oo 
m, ' Q12 
and 
1 2 
eee eee 
Mm, 12) 


Equation (2.18) is exact, but H%' is not an operator in #°"; it would be an 
operator in #°"° if we made the replacement 


Oi, > OF (2.20) 


In our classical picture this would mean that the electron that is close to the 
nucleus is really at the position of the nucleus. With the asymptotic re- 
placement (2.20), the energy operator for the electron far away from the 
nucleus has the asymptotic form 


eevee! | a tee Ot, (2.21 
2m, Q, Q1> 2m, On 7 Mm, petty ) 


ie., the energy operator for the distant electron is the same as the energy 
operator of the electron in the hydrogen atom. In the approximation (2.21) 
the energy operator H, is approximated by 


Pe 1 1 1 1 
Be eh ay hy + a Ay yyar = ae eae 3 Ay wyar> (22) 


€ € 


el _. 
Hy = 


where H, pyar is the hydrogen-atom Hamiltonian corresponding to (VI.3.5). 
Letting n’ > n, the spectrum of Hp is then 


~ ~ 1 1 
spectrum Ho = Ean’ aa aa Rye n2 =e n’2 


4 1 44 
ak. (= + =) > ERY = —R" i 2 7) (2.23) 
n n n n 


A more realistic approximation for Hy of the form (2.18) is to make the 
replacement 


pe Zee 
[PRs ee > i) : (2.24) 


where Z). is a number between one and two that expresses the screening of 
the nuclear Coulomb field by the nearby electron. A possible approximation 
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for H, is thus 
Ho 4 AoZere) = Hizecs) ag HSiZeee) (2.25) 


The value of Z. should be different for different states of the helium atom. 
For the ground state one would expect Z{}/ = Z{{}; for the state with one 
electron in the ground state and the other in a very high state one would 
expect, according to (2.22), Z{}) ~ 2 and Z{{} = 1. Thus for different sub- 
spaces we have different operators Ho,z.,,)- 

Let us now consider the subspace of states of the helium atom that can be 
characterized—in the classical picture—in the following way: One of the 
electrons has just dissociated itself from the helium atom, 1e., the system 
consists of a He* ion together with an electron of zero relative energy. Let 
us call the subspace of these states #,,; on this subspace Hy can be ap- 
proximated very well by 


I 1 
Hy =—h, + Hf =— i 2.2 
=] hi Ha = hat Wi (2.26) 


e 


where H¢! is approximated by (2.21). The subspace #,, is then defined as 
that subspace of #°"* on which 


ip —= 10) Q27) 
Let us denote the eigenvectors of H¢' in #°° by 
Wadia. (2.28) 


in distinction to the eigenvectors |n/1,>, of h,. These eigenvectors have the 
property [under the approximation (2.21)] 


1 
Hi (nls). = —R" —y|n 113}, (2.29) 


and are eigenstates of 12 and L,,. The eigenvectors on which (2.27) is ful- 
filled are denoted by 


|00 FI5}a- (2.30) 
The direct-product basis in # ,, is therefore given by 
[ol Ts}; @|nlls>2, — |nI1s>1 @ [ool }>. (2.31) 


We introduce in .#,, the basis of symmetric and antisymmetric vectors 
I 
We 


On #, the energy operator (2.26) has, according to (2.27), the following 
spectrum: 


(Joo P13}, @ |nlls>2 + |n1l3> @ |oo V13},). (2.324) 


4 
I 
Eno = Spectrum Hol”. = ——5— —+ = —Rie se (2.33) 
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For Ho of (2.26) applied to (2.32 +) gives 


h 
(a 13}, @ = |nlls>, + Holy}, @ |n1 ls), 
mM, — — 
==(() 


1 
Ra 


h 
+ + InI1,>, @ |ol’ Ih}, + Inl1,>, ® H$}| 00 I’ 3] 
me —_———__ 
=(() 


= — Spar zglorh 331 © Inll,>, + (nll, @ lal i},). 


The E,,,. are the energy values of the helium atom when one of the electrons 
has zero energy and are also the energy values at which one of the electrons 
dissociates from the atom (ionization thresholds). We expect that these are 
the highest energy values for a given quantum number 2, i.e., for states in 
which one of the electrons has the principal quantum number n, because if the 
other electron has the principle quantum number rn’, its relative energy with 
the He” ion will be negative. E,,,. is called the nth ionization threshold; it is the 
value at which one of the electrons is just dissociating itself from the atom 
while the other is in the nth energy level. 

Comparing (2.33) with the spectrum of Hoo given by (2.16), we observe 
that 


Ene + E,.. from below asn' => oo. (2.34) 
Comparing it with (2.23), we observe 
Eww — Eno from below asn’ > oo. (2235) 


From our above considerations we expect the eigenvalue E,,,, of Ho to lie 
between the eigenvalues of Hp, and Hy: 


Phe ee FO (2.36) 


a 


In the energy diagram of helium we first draw the energy levels E,,,, 
(n = 1, 2, 3,...). The energy levels E,,,, lie, according to (2.34), (2.35), and 
(2.36), below E,,,,, and lie closer and closer together as n’ becomes higher. 
The larger the values of n’, the closer are the eigenvalues of Ho, to the 
energy values of the helium atom. Consequently for large n’ and a given n, 
the space spanned by the vectors (2.15+) and (2.15—) represents to a good 
approximation the space of physical states. In the classical picture of the 
helium atom, increasing values of n’ mean that the “second” electron is 
further away from the nucleus and from the “first” electron, so that the in- 
fluence of the interaction term e7/r, , issmall; thus the classical picture supports 
the result described above. For decreasing values of n’ the interaction term 
e”/r, becomes more important, and we expect larger deviations of the energy 
value of the helium atom from the value E®°. We expect the largest deviation 
for the states in which both electrons have the principal quantum number 
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n =n’ = 1. In this case the interaction term is expected to be largest (both 
electrons are closest to the nucleus and to each other), and E99 is a very poor 
approximation to the eigenvalue of Hy. In this case, therefore, the space 
spanned by (2.15+) with n = n’ = 1 is far from being the space of (orbital) 
physical states. Thus for low values of n and n’ the helium atom cannot be 
considered as consisting of two independent electrons (independent-particle 
approximation) in a Coulomb field. The physical states are not states in 
which one electron is in one particular state and the other electron is in 
another particular state; rather it is only the helium atom considered as a 
whole that is in the physical states. 

We now consider the space of states (which we call 4,) of the helium atom 
that have energy values below the value E,,,. The space of eigenstates of H, 
with eigenvalue E,,, describes the state of the helium atom in which one 
electron is in the lowest state (n’ = 1, I’ = 0, I; = 0) and the other electron 
is just dissociating from the helium atom. The eigenvectors of Ho, in this 
subspace 4%, can be obtained from the independent-particle states (2.15*): 


(In’ = 1F = 08, = 0), @|nll;>, 


1 
Wns fad j2 


+intl,>, @|n =i =O) = 0>,): (2:37) 


i 
Patty = oe = 1 =015, =0), @|nll,;>, 
—|nll,>,@|n' =10 =04%, =0),). (2.38) 


The Wnt span the space of symmetric states of 4,, which we call 4, ,: and 
the y?),, span the spac of antisymmetric states ‘i #,, which eo call 4, 
That is, y?,, and y®,,, are eigenstates of Ho, with teat Eee 


nn’=1- 


1 
AyoWnn, = — Rite (i aE =) iss (2.39) 


1 
Hoo Yrut, = —R. ( Bia =) 2.40) 


As I’ = Oand /; = 0, Wo, and Yeu, are also eigenvectors of L? and L,; with 
eigenvalues ((/ + 1) and /,, respectively. Let us denote by Wau, and y,1,, the 
eigenvectors of Hy, L?, and L, in 4,, and in 4, , respectively. The eigen- 
vectors Wyn, and yn, a H, can be obtained fom te eigenvectors Wr),, and 
Yau, Of Hoo by acting on hen with a (unitary) operator that does not change 
the eigenvalues of L?, L,, and P,,: 


Watts a U b (nln, e] (2.41) 
Ynits = U 7 (nl)yen, bg (2.42) 


This transformation U*(n!) may be different for different values of n and i 
U*(nl) does not depend upon /;, as LH), L;] = 0. Equations (2.41) and 
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Figure 2.1 Splitting of the eigenvalues of Ho. under the influence of W. 


(2.42) show that the eigenvectors of Hy can still be characterized by the 
principal quantum number n, though they might be quite different from the 
eigenvectors of Ho, with eigenvalues Ep’, = —Ry,.(1 + 1/n?). For large 
values of n, U*(nl) is very close to the unit operator. The eigenvalues of Hy 
in £, we denote by E,(J) and E,,(/); in general they will depend upon /: 


AoWnn, = EmOWats> (2.43) 
Ao ynu, = En (yan: (2.44) 


Weare not that much interested in the calculation of the exact values of E4()); 
as we want only to obtain a qualitative understanding of the energy spectrum 
of the helium atom. Therefore it is not necessary to know the operators 
U *(nl). (They are connected with the interaction term W and can be calculated 
by approximation methods.) 

The important conclusion for the qualitative understanding of Ws energy 
spectrum that we caaat from (2.41) and (2.42) is that to each n?-fold de- 
generate eigenvalue E?? of Hog on 4, , there correspond the n energy values 

E,,(), the interaction inn W splits E?? into nsublevels as shown in Figure 2.1. 
And to each n?-fold degenerate eigenvalue E°? of Ho, on 4, - there correspond 
the n energy values E,,,(/). As the result of our above consideration we obtain 
the following energy spectrum of the helium atom below E, , (Figure 2.2 and 
2.3): The lowest energy value is E;,(! = 0) in the symmetric subspace #°""”, 
which is connected with E?? = —Rj,(1 + 1). There is no corresponding 
energy value in the aasoiuuanidid subspace #°"*, because Yiu, = 0. The 
values of E,, and E°° as calculated from (2.23) and (2.16) are 


E,, = —68.05eV and E99 = —108.8 eV. 


The experimentally measured value of E/,(/ = 0), the double ionization 
energy of helium (minimum energy required to free both electrons), is: 


Ee 0) = — 79.0 eV, 


which shows that Hoo is really a very poor approximation to the energy 
operator H,. The energy difference 


E,. — E*,(0) = —54.4eV + 79.0 eV = 24.6 eV 
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Figure 2.2 Energy levels of the Helium atom below the lowest 
ionization threshold. 


is the energy that is necessary to dissociate one electron from the helium 
atom when it ts in the ground state; it is called the “ionization energy” or 

5 eS : “ye 
“ionization potential.” For n = 2 there are energy values Ej,(/) in wo” 
and E;,(1) in #°°°. They correspond to the eigenvalues 

EYS = —Ri(1 + 4) = —68.0eV 

of Ho, and 

E,, = —RR.-1—R”. $= —578eV 


of H. The experimentally measured values are 
E,,(0) — E;,(0) = 19.8eV or E;,(0) = —59.2 eV, 
12(0) — E7,(0) = 20.6eV or Ef,(0) = —58.4eV 
for 1 = 0, and 


E;,(1) — E¥,(0) = 209eV or E;,(1) = —58.1 eV, 
E},(1) — Ej,(0) = 212eV. or Et,(1) = —57.8eV 


for | = 1. Thus for n = 2 the energy values are already closer to the eigen- 
values of Hoo, as we expect from our general discussion above. For n = 
3, 4, 5, ... the agreement improves further. 

Figure 2.3 gives all known discrete energy levels of the helium atom. This 
energy spectrum is in agreement with our discussion above. To summarize 
our findings for the energy spectrum below E, ,,, there is one distorted, shifted, 


Singlet states 
“parahelium” #°r 


Ey, (l = 0) 


Figure 2.3 Term diagram for energy levels of helium. (The part below 
the first ionization threshold is again given in Fig. 2.2 which is the term 
diagram of the Helium that one usually finds.) The energy levels above 
the first ionization threshold E, 99 lie in the continuous spectrum. 
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and split hydrogenlike spectrum of states in the space Hand there is 
one distorted, shifted, and split hydrogenlike spectrum—without the lowest 
level—of states in the space #°"’. According to (1.23) the states in #7" 
are states with total spin zero, called singlet states; the states in #""° are 
states with total spin one, called triplet states. As we shall explain below, 
there are practically no transitions between the singlet and triplet states of 
the helium atom. (The 2°S state is a quasistable state with a very long lifetime). 
Therefore the splitting of the energy diagram into two parts had in earlier 
days lead to the hypothesis that helium was a mixture of two elements, 
“parahelium,” described by #2" @ #=°, and “orthohelium,” described 
by 4 oe: 

For parahelium the total angular momentum is equal to the total orbital 
angular momentum L= / because the total spin S = 0. Therefore the in- 
teraction caused by the spin H, cannot split the energy level Ej,(/), but can 
only cause a small shift of the same order of magnitude as the fine structure 
in the hydrogen atom, which has been calculated by approximation methods 
(singlet). 

For orthohelium the total spin S = 1. Therefore the direct-product 
vectors 


Yn, © |s = 153) (2.45) 


in %,_ © #' are not eigenvectors of the total-angular-momentum op- 
erators J* and J;, where 


J, =L,@1+1@S,. (2.46) 


To obtain eigenstates of the total angular momentum we have to couple 
the orbital angular momentum and the spin in 4,_ © #°~' according to 
the rules of Section V.2. Let .@' denote the subspace of 4,. < #° with 
orbital angular momentum /; it is the space spanned by y,,,,, with a fixed value 
of |. The eigenspaces .# of total angular momentum j are then given by 


MO! @ MI" @ MF"! GFL #0, 


Mim1 ifl = 0. ae 


M' ® R= 3 = 
So we see that for orthohelium, except for the S states, all orbital-angular- 
momentum spaces are actually a triplet of total-angular-momentum spaces. 
Therefore the spin perturbation H, of the Hamiltonian H = H) + H, 
splits the P(/ = 1), D (1 = 2), and F (I = 3) energy levels into triplets of 
fine-structure levels, each belonging to an eigenspace of total angular 
momentum j = /1+ 1,11 —1. 

These results agree with the experimental energy spectrum: parahelium 
consists only of singlet terms, whereas the orthohelium consists (except for 
| = 0) only of triplet terms (these are so close to each other that they are 
shown by one line in Figures 2.2 and 2.3). To avoid labeling the states by their 
symmetry properties under permutation of the orbital part (ortho- and para-), 
one also calls the S term of the orthohelium a “triplet” terms, *S,, though it 
consists in fact of only a singlet. 
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XI.3 Selection Rules and Singlet—Triplet Mixing for the Helium 
Atom 
The operator of the dipole moment for the helium atom is 


d= eQ, @/+1@Q,), (3.1) 


where d is a vector operator with respect to the total angular momentum 
operator 


J,=J,,@1+1@J,,=L,@1+1@S, (3.2) 
and also with respect to the total orbital angular momentum 
L,=1,,;@7+] @L),;. O35) 
Thus it obeys the commutation relations 
LJ;, dj] = i€:yd, (3.3) 
LL;, dj] = i€;dy (3.3’) 


As a consequence, one obtains by (V.3.7) for dipole radiation the selection 
rules 
JoJ-—1, Ale J+1 (3.4) 


LoL—1, L, L+1 G5) 


The physical states are total angular momentum eigenstates, therefore the 
selection rules (3.4) hold strictly for physical states. The selection rule (3.5’) 
holds to a very good approximation, precisely to the extent to which H, 
commutes with L?. 

The parity operator for the two-electron system is the direct product of the 
parity operators in the one-particle subspaces: 


UG =U. Uae (3.6) 
Therefore, for the vectors (2.41) and (2.42), one obtains 
U, Waits = (- 1)'No Watt UY ntts = (- 1) nnn, (3.7) 
because L = 1 below the first ionization threshold. As (3.1) is a proper vector 
U,dU,' = -d, 


it follows by the arguments of Section V.4 that the transition L —> L is 
forbidden, so that with (3.5’) one has the selection rule 


Lo-L+1, L —1. (335) 
A further selection rule follows from the fact that 
[d, Piz] = 0 (3.8) 


For every vector 
eae 4 (3.9) 
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and 
pe HI” @ae (3.10) 
it follows as a consequence of (3.8) that 
dy = ye’ @r (3.11) 
dy = We HH?” @ (3:12) 
Therefore, 
(W, dy) = 0 (3.13) 


Consequently, there are no transitions between singlet and triplet states (the 
same arguments as above hold for the quadrupole and higher multipole 
transitions, too). As already mentioned in Section 2, this had led to the hypo- 
thesis that helium is a mixture of two elements—parahelium, possessing only 
singlet states with an energy diagram given by the left part of Figure 2.2, and 
orthohelium, possessing only triplet states with an energy diagram given by 
the right part of Figure 2.2. However, one then observed transitions of very 
weak intensity between energy levels of para- and orthohelium. These inter- 
multiplet transitions are due to “singlet-triplet” mixing”* caused by H,. We 
shall describe the eigenstates of total angular momentum in the spaces 
HO”? @ B-° HH @ F*=!, which are obtained by coupling the spin and 
orbital angular momentum, according to the rules of Chapter V. For 
Hor’ @ A? we obtain: 


wey = Watts =J, @ diI83>4 |s3>2<385, 383 |00> (3.14) 
where W,1, 8 given by (2.41) with (2.37) and the second factor on the right- 
hand side of (3.14) is the same vector as the one given by (1.18). 

For #°* @ #=' the total angular momentum eigenstates are given by 


Yat? = DY ntts ® (Zs [253 >2<485, 385 | i>] CU 1s3|JJ3>, (3.15) 
1383 $455 
where y,,,, is given by (2.42) with (2.38). Fora fixed n and J, 7)? is a basis in the 
spaces .@’ on the right-hand side of (2.47). 
According to the considerations in Chapter VIII on perturbation theory, 
the eigenstates of H = Hy + H, are in first order given by 


ey yee JJ JJ JJ 
ni = Wat? + ), (eee = Fae Win’ Pl Hy | Wai ?> 
nl’ Ei 

no#n 
[hee ai: 


i 
ip a Mirae | Ay War) (3.16) 


* Mixing is the common term used for this kind of phenomenon, which is a somewhat 
misleading nomenclature as compared with the meaning of the word mixture of states in quan- 
tum mechanics. The singlet-triplet mixed states are not a mixture of states but a linear combina- 
tion of states. 
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and by 


1 
Te at a), (= Ba, Yee at A ai) 


nl’ 
’ 


1 
+ py ag Re it), 3.17) 


where E,, = E,\(1) and E,, = E,,()) are the eigenvalues (2.43), (2.44) of Hy 
in the singlet and triplet states, respectively. 

In (3.16) and (3.17) we have already taken into account that [H, J;] = 
[H,, J;] = 0. From parity conservation, 


[H,U,]=0 (3.18) 
and, consequently, [H,, U,] = 0 then follows according to (3.7): 
(Uni |Hy|Yai?> = 0 unless (—1)" = (- 1), 


3.19 
Cyt |H, ly) = 0 unless (—1)" = (—1), ee) 


and 
Wi? lH ly2?> =0 unless (—1)"4o = (—1)'ny. (3.20) 


In the perturbation series of (3.16), (3.17), only states of 4,, or 4, with 
energy below the first ionization threshold have been included. This should 
suffice for the qualitative considerations that we wish to present here.* 

The matrix elements <y24?|H,|yZ/3> and (wi#}|H,|W23) give the fine 
structure contributions, i.e., the splitting of the terms *(L), into three energy 
levels and a small shift of the energy levels of '(L),. Calculations show (Bethe 
and Salpeter, 1957, section 40) that the J = 1 andJ = 2 levels happen to have 
almost the same energy. 

The matrix elements <w24?| H, |yZ7*> give the singlet-triplet mixing, which 
we shall discuss now. 

H,, like every observable commutes with the permutation operator P, 2, 
which represents the transposition of the two electrons. However, if we write 
P, 2 asa direct product of an orbital part P{’? and a spin part Pj, according to 
the representation (1.23) 


Py. = Pi? @Pi2, (3.21) 
then 
[H,,P97] 40, [Ay Pi.] #0, (3.22) 


i.e.,H, contains contributions that are not symmetrical in the orbital operators 
and spin operators separately. An example of such a contribution is the 


4 For more details, see Bethe and Salpeter (1957), in particular Chapter IIb. 
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interaction between the spin magnetic moment and orbital magnetic moment 
of each electron which would, according to (XI.3.23), be given by 


H* = C(Qq))Say- Lay + C(Q2))S2y- Ley (3.23) 
with 
e? 1 
Co = CQe) = 5 (act) or (3.24) 


H' is only one term in H , : other terms like the relativistic mass correction term 
and the interaction between the spin magnetic moment of one electron and the 
orbital magnetic moment of the other are of the same order of magnitude. As 
we want to discuss only the principle of singlet-triplet mixing, we will 
restrict our discussions to (3.23). 

In order to calculate the matrix element (y273 | H! | 743), we use (3.14) and 
(3.15) and obtain 


Gee eG ee) = 1S d COnry,|Cayeay rss) IIs ae Is3>¢1s3|83 53> 


1383 5383 
$383 


x €53|S(1y 153 > 55 155 <555 |00> 
+ (Ywri31C (ayia) | Wass) (JJ3 (UT; 183) (183 | 83 55> <s3|55> 
x 53 |S.2)|55> <55> (53 55 00D) (3.25) 


Changing in the second term the summation indices s > s, 55 > 3, and 
making use of the property of the Clebsch-Gordan coefficients <45', $53 | 153) 
= (483 355| 13>; (353 483|100> = (—1)<$s5455|00 one obtains 


=) Yd <JIs|Ils 153) <153|53 83> <53]$]55> <55 5300) 
1383 $383 ° 


x {wir | Caybay PD) = (Yar, | Cpa Lay | Wass) }- (3.26) 
Since Wj), iS symmetric and },,, is antisymmetric under P{'? one has 
(wr! Cay) | Wass) = — (wry | Cayley |Ynas,)- (3.27) 


To calculate these matrix elements, one uses (2.41) and (2.42) with (2.37) and 
(2.38) and obtains four terms. Two of these terms are zero unless I' = 0,J = 0, 
and one of them is zero unless I’ = J. In order to reduce the number of terms. 
we shall restrict ourselves to the case I' 4 J = | and obtain for this case 


Onn, | Cyl) | Wass) = —<nll,|U- : (NT)C Ly U*(nJ)|nJJ3>; 
— <n lh |CayLy|n* JJ3; (3.28) 


where ican be either | or 2and U “(n'l') and U* (nJ) in here are the restrictions 
ofthe unitary operators of (2.42) and (2.41) to #°". All weneed to know of them 
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here is that they are scalar operators (because W is a scalar operator) and do 
not change the value of I'l’, and JJ,. Thus the new vectors 


In*lls>; = U*(nD|nll,); (3.29) 


are again eigenvectors of L? and L with the same eigenvalue. Inserting (3.28) 
and (3.27) into (3.26), one obtains 


(n't || Wns?) 
er y Y (JJ3| LI; 1s,><1s3|453, a) Cos Siglo ome 755 |00 


I383 S353" 
$3 


ol 1) <n Lh |CaLa|n* JJ3)>;, (3.30) 


where C,,) is an L,)-scalar operator. Liye K = 0 + 1, are L,, vector operators. 
Thus 


Mn EG | Cala. | JI3>; = IIs 1K|L1,> <n’ LI Cy Lalla? D>. 3.31) 
The reduced matrix element can also be further calculated using (V.3.10): 


CAL CaLealln* JD = bp/IS + YXn'[|Calin*>;. (3.32) 


Further evaluation of the quantity <n’ ||C,,)||n* >, requires actual calculation 
using perturbation theory with W as the interaction Hamiltonian or other 
approximation methods for the determination of |n*//;) and then cal- 
culation of the matrix element ¢n’~ ||C,,||n* >, using (3.24). We shall not do 
this calculation here but just remark that these quantities are of the same 
order of magnitude as the fine-structure splitting terms which are given by 
<n!~ICwlin” >. 
Inserting (3.31) and 


X53 /S(xl53>i = 333 1x|353>./23 (3.33) 
into (3.30), one obtains 


GrelH' War? > = Dd <II3 1k LI3> (355, 253 100) 


K 1383 sgsz' 53 


x <$55, 38 | 1s3> (3383 Lx |255> (LI, 183 |JJ3> 
x /2dn'~ LI|CwLalin* J). (3.34) 
It is now a straightforward though slightly tedious calculation using the 


symmetry properties (V.2.48) -(V.2.53) and orthogonality relations (V.2.14), 
(V.2.15) of the Clebsch-Gordan coefficients to calculate (cf. Problem 4) that 


Ee ye 19 25 / IT 1) <at CHIR. B.35) 


Therewith we have expressed the matrix element of the spin orbit interaction 
between singlet and triplet states in terms of a quantity which (can be only 
calculated numerically) is different from zero and of the same order as the fine 
structure splitting term. 
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Inserting this quantity into (3.16) and (3.17) we see that the eigenvectors of 
the operator H = Hy + H,, #77? and WJ?, are not pure triplet and singlet 
states, but have a small component in the singlet and triplet space, respectively. 
Thus, an orthohelium state 7 is mainly triplet y with a small singlet component 


F=y+ ep, le| <1, (3.17') 
and the parahelium state w is mainly singlet yy with a small triplet component 
w=w'tey’, lel <i (3.16’) 

The dipole matrix element between para- and orthohelium is therefore 


(Y', dj) = (w'dy) + eC’, dp) + €(y'dy) (3.36) 


and even though the first term is zero according to (3.13), this matrix element 
is different from zero due to the “singlet—triplet mixing.” 

We had mentioned above in Section VII.1 that it was impossible to verify in 
the spectra of one-electron atoms whether the energy values were the ex- 
pectation values of the energy operator in a state which is not an energy 
eigenstate or the eigenvalues of the energy operator. From the helium spectrum 
it follows that the latter is the case; the physical para- and orthohelium states 
are eigenstates of H = H, + H, and not pure singlet and triplet states with 
the energy values being the expectation values of H between these pure 
singlet and triplet states. The existence of the transitions between para- and 
orthohelium states is an experimental verification that the physical states are 
eigenstates of the energy operator. In Chapter XII we will see that for theoreti- 
cal reasons all stationary states have to be eigenstates of the energy operator. 


XI.4 Doubly Excited States of Helium 


The energy levels E,,,(1) of the helium atom are experimentally very well 
known. They have beén measured by light spectroscopy, i.e., by measuring 
the wave number (or frequency) of the emitted or absorbed light in a transi- 
tion between two energy levels according to 


l 2 
Veh at > ae (E, nD) a Ey xD). 


The energy levels have also been measured in energy-loss experiments as 
described in Section II.4, and as depicted schematically in Figure I1.4.1. 
The collision chamber was filled with helium gas, and the intensity of the 
electron current was measured as a function of the energy lost in collisions 
with the helium atoms. The results of this experiment are shown in Figure 
4.1, where the intensity J has been plotted as a function of the energy loss E. 
According to the interpretation of this experiment in Section IT.4, the energy 
lost by the electrons is used to excite the atom and cause the transition from 
the ground state 1 'S to an excited state; therefore the intensity shows maxima 
at values of energy loss that agree with the energy difference between the 
ground state and one of the states excited by this experiment. The excitation 
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Table 4.1 Excitation energies of helium 


Excitation energy (eV) measured in 


light spectros- energy-loss 
Transition copy experiment experiment 
Se2°P BVM 2! 223 
ae 23.081 23.084 
4° P 23.736 23079 
Se 24.039 * 24.034 
oP 24,205 24.209 
(ise 24.304 24.308 
Sar 24.369 24.374 
9'P 24.413 24.419 
10°P 24.445 24.451 
WE 24.469 24.474 
ee 24.487 (24.493) 
2s 20.610 20.612 
2527 P 60.123 60.120 
253p'P 63.651 63.651 
2s4p'P 64.462 (64.450) 


energies that are observed in this experiment are tabulated in Table 4.1, 
column 3, and are compared with the energy of the n' P levels measured by 
light spectroscopy (column 2); the first column gives the energy levels that 
participate in the transition. Not given in the table are the bumps between 
42-eV and 46-eV energy loss. These bumps are the result of two successive 
excitations (double scattering); the electrons in the collision chamber 
scatter inelastically twice, losing energy to the transition 1 'S > 2'P and 
to the transition 1 'S — n'P, or two times to the same transition | 'S§ = 3 'P. 
From Figure 4.1 we ste that there is a very weak transition 1'S > 2'S, 
and that the transitions of high intensity are the 'S > 'P transitions (dipole 
transitions). No transition between the 1 'S ground state and the 'D, 'F,... 
states or between the 1 'S state and the triplet states are observed, which 
agrees with what one would expect for dipole excitations. 

Figure 4.1 shows that there are further bumps above 60 eV; these bumps 
look different from the lower-lying bumps and have a typical asymmetric 
profile. They correspond to energy levels that are well above the ionization 
energy for one electron. Energy levels with such a high energy above the 
ground state level have also been observed as spectral lines in the optical 
(ultraviolet) absorption spectra. Figure 4.2 shows a photograph of the 
absorption spectrum obtained for helium in the region 160 215 A. The 
observed discrete spectral lines are superimposed upon a continuous ab- 
sorption background. The lowest-lying discrete spectral line in this region 
is at 206.21 A, corresponding to an energy of 60.1 eV: this agrees well with 
the position of the first bump of this kind in the energy-loss spectrum of 
Figure 4.1 (cf. also Table 3.1). It is the lowest-lying line of a single long series 
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of discrete spectral lines of outstanding intensity that converge to the energy 
value 65.4 eV. (Some of these lines have corresponding bumps in the energy- 
loss spectrum of Figure 4.1.) Three fainter lines indicate the existence of a 
second series that also converges towards this energy value. A number of 
fainter lines can be seen in Figure 4.2 lying to shorter wavelengths from the 
most prominent series. These lines are grouped into series having as limit 
points the energy values 72.9 eV and 75.6 eV. Thus we have a series of series of 
spectral lines. As these lines are observed in the absorption of electro- 
magnetic radiation by helium atoms in the ground state, we can safely 
assume that (at least for the high-intensity lines and probably for the fainter 
lines too) they correspond to dipole transitions and that (according to the 
selection rules for dipole transitions) the corresponding excited energy 
levels belong to singlet states with orbital angular momentum 1, i.e., are 'P 
states. 

The threshold for double ionization of the helium atom—t.e., the energy 
value E,,,, at which both electrons have just dissociated from the helium 
nucleus and the physical system is in a state consisting of the nucleus and 
two electrons, both with relative energy 0—lies 79.0 eV above the ground 
state. We stated this experimental fact before by saying that the ground- 
state energy E{, = —79.0eV. [The energy scale in Figure 3.2 gives E(level) — 
Ey, .] 

The observations in Figure 4.2 indicate that one has a series of series of 
'P energy levels, and it appears that the limit point of this series of series lies 
79.0 eV above the ground-state energy, i.e., at the threshold for double 
ionization. The limit points for each single series are then seen to agree with 
the energy values E,,, as calculated from (2.33): 


1 1 
Eno ~ Ei, = —Rie a + 79.0 eV = (—54.4eV) | + 79.0eV; (4.1) 


hence 


Eo — Et, = 24.6 eV, 
E,. — Et, = 654 eV, 
Ex. — BE; eeeiey 
E4o — Et, = 75.6 eV. 


(4.2) 


Thus the observations in Figure 4.2 shows that the discrete 'P energy levels 
converge to the nth ionization threshold of the helium atom, Le., to the 
energy value at which one electron has the principal quantum number n and 
the other has just dissociated itself from the helium atom. These experi- 
mentally measured 'P levels above the first ionization level are depicted in 
Figure 2.3. Also depicted there are all the other known (1971) discrete 
energy levels of the helium atom. We see that the energy-level diagram 
below the second, third, etc. thresholds is qualitatively very similar to the 
energy diagram below the first threshold. 
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A rough explanation of the discrete energy levels above the first ioniza- 
tion threshold is now very simple:5 Whereas the energy levels below E,,, 
correspond to states in which one of the electrons is always in the ground 
state (n = 1), the energy levels below E,,,, n > 1, correspond to states in 
which one of the electrons is in the state with principal quantum number n and 
the other is in a state with n’ > n. Both electrons are excited, the atom is 
doubly excited. In the approximation where the interaction W between the 
two electrons is neglected, these are the states that correspond to the vectors 
(2.15+) with n' > n > 1. For the case n = 2, (Zea) teaas 


1 
Woy, n'Vls = va @ |n'l'l5>, + |n'l 3>1@|211,>2, (43+) 


1 
V2us.0ti, = fas @ |n' lls, — |n' I'l), @[2113>2,  (4.3—) 


where / can take the two values | = 0 and / = 1; i.e., we have the singlet 
States Wo tts n't'ts and Woo0.n't'ly and the triplet states Pits n't ts and Yo0.n't's: 
If we want to explain the 'P levels, we have to couple the angular momenta 
! and I’ to obtain eigenvectors of total orbital angular momentum L? = 
(L, + L,)? with eigenvalue L(L+ 1) = 1(1 + 1). As L can be any of the 
values /!+/,1+ Il —1,...,|1—T'|, we can obtain L = 1 in the following 
cases: 


Pap J] OF FH 1) L=let x= —1 
ase 1 = ey r= —1 
; ; (4.4) 
2pyp t= 1 P=1 EPS=ti+f—-1 «=+1 
2pnd l=1 Y=2 L=!l+l-2 n=-1 
where 7 is the parity, x = (—1)\(—1)". Therefore one has three possibilities 
to obtain 'P states with parity z = —1 as linear combinations of the states 
GS i): 
C= a . (4.5p) 
(ee er )), (4.5s) 
(= amen, (2): (4.5d) 


All of these linear combinations are eigenvectors of Hoo with the same 
eigenvalue E°°. The energy shift due to the interaction of the two electrons 
depends not only on n’ and L but also on / and I’; one would therefore expect 
three different energy values for every value of n’ (except for n’ = 2). Thus 
one would expect there to be three series of energy levels converging to the 
E,,, limit. Figure 4.2 shows only two series, one with a very high transition 


* Extensive theoretical calculations of these energy levels by various approximation methods 
have been performed and agree with the observed values; see J. Macek, J. Phys. B 1, 831 (1968) 
and references therein. See also U. Fano: Doubly Excited States of Atoms in Atomic Physics 
edited by V. W. Hughes et al., Plenum, New York (1969) 
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probability from the 1'S ground state and the other with a very low one. 
This indicates the existence of an approximate selection rule that cannot 
be explained on the basis of the assumption that the spaces spanned by (4.5p), 
(4.5s), and by (4.5d) are good approximations to spaces of physical states. 
It is believed that linear combinations of (4.5p) and (4.5s) span, to a good 
approximation, the spaces of energy eigenstates that belong to the two ob- 
served series,° and that (4.5d) corresponds to a still fainter and therefore yet 
unobserved series. 

The situation becomes even more complicated for n = 3,4,.... With 
similar arguments the *P series can be discussed. 

We conclude the discussion of the double excited states with a remark 
concerning the broad profile of the spectral lines corresponding to these 
states, which is shown in Figure 4.1 and is also visible in the optical spectra. 
These energy levels lie above the first ionization threshold. Therefore, in 
addition to the doubly excited states with n = 2, n' > 2 there are states with 
the same energy as the energy of the doubly excited states which belong to 
the system that consist of the He* ion in the ground state n = 1 and one 
electron (He* + e). The system He* + e can take any possible energy 
value E > E,,,, because the relative energy between He* and e can take 
any value, i.e., the energy spectrum of He*(n = 1) + e is continuous above 
E,..- Therefore each energy value of the doubly excited states coincides 
with one of the energy values in the continuous spectrum of the states 
of the system He*(n = 1) + e. Thus there are two different states with the 
same energy: the discrete state n = 2,n’ > 2, and the state of the continuum 
with n = 1 anda relative energy between He*(n = 1) and e. These two states 
mix and interact with each other, as a result of which the doubly excited 
state cannot persist and easily decays into the state of a He *(n = 1) and an 
electron. The lifetime of such doubly excited states is therefore much smaller, 
and the spectral line has the typical broad profile. Such states are often called 
resonances, . 

For the doubly excited states with n = 3, n’ > 3, there exist two continuum 
states with the same energy, namely He *(n = 1) + e and He‘(n = 2) + e. 
For n = 4,n’ > 4 there are three, and so forth. 


Problems 


1. Calculate the length of the vectors 
oqo Cyt = »y Plc ® lc2> ®--- @lew> 
P 


and 


ley C30 Gye = ¥ (-1PP E> ® I¢2> ®---@len, 


® J. W. Cooper, U. Fano, and F. Prats, Phys. Rev. Lett. 10, 518 (1965). 
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where the |¢;> are normalized basis vectors in the space #, and the sum runs over all 
permutations P. The factor (—1)? is +1 or —1 according as P is an even or odd per- 
mutation. 


2. Show that the symmetric and antisymmetric parts of #? are given by 
HX = (HL @ 25) @ (HA @ 2°), 
H = (HE B22) @ (HE @ 2%), 
respectively. 
3. Check whether H' given by (3.23) commutes with the operator of total orbital 
angular momentum square L?. 
4. Using the properties of the Clebsch-Gordan coefficients given in Section V.2, 
perform the calculations that lead from (3.34) to (3.35). 


CHAPTER XII 


Time Evolution 


In this chapter the basic assumption on time evolution is introduced. The 
Schrédinger picture, the Heisenberg picture, the Dirac picture, and their 
relations are discussed. Time-dependent external forces are only briefly 
mentioned at the end of the chapter. 


XII.1 Time Evolution 


Until now we have not mentioned time and have completely ignored the 
time evolution of the state of a system and the time development of an 
observable. We have restricted ourselves to properties of systems measured 
at one instant. In general, one should specify not only the physical values 
obtained in a measurement but also the time at which the observation is 
made. In this chapter we consider the time evolution —the dynamics —of 
physical systems. Although the notion of one instant of time makes as little 
sense physically as the notion of a definite position (measurements always 
require a finite amount of time in the same way that they always require 
a finite amount of space), we make the idealizing assumption that time is a 
continuous real parameter f that labels a sequence of states. (Note that we 
have called time a “ parameter” as opposed to an “observable.”) That is, we 
make the idealizing assumption that measurements can be made at a succes- 
sion of arbitrarily small time intervals, and that at any instant of time the 
specification of a complete set of observables is possible. 

274 


Time Evolution 275 


Let us denote by W(to) the state of a system that is prepared at the time 
t = to. We then expect that W(t)) will uniquely determine another state 
W(t,) at any later time t = t, > ty. During the evolution the system may be 
subject to external influences, so that the manner of evolution will depend 
upon the interval from fo to t;—its starting time ty as well as its length 
t; — to. We first consider systems that evolve undisturbed by external 
influences (isolated systems). For such systems the evolution 


W(to) > W(t) (1.1) 


does not depend upon ty but only upon the system itself, the initial state 
W(to), and the length t, = t, — to of the time interval from tg to t,. Our 
intuitive understanding of the time evolution suggests that we impose the 
following requirements on the map (1.1): 


1. Expression (1.1)isa linear transformation of the set of states, continuous 
in a certain sense: If W,(t 9) > W,(t,) and if W,(t,) > W,(t,), then for 
a,beC, 

AW, (to) + bW,(to) > aW,(t,) + bW,(t,). 


2. Timeevolution forms an additive semigroup, ie.,if W(t.) > W(t o + 7,) 
fort, =t, — to and if W(t,) > W(t, + t2) for t. = t, — t,, then 


W(to) > W(to + Tt, + 72). 


Requirement | expresses the physical statement that first mixing the states 
at time tf, and then letting the system evolve in time leads to the same results 
as first letting the states evolve separately in time and mixing them afterwards 
at time f,; the continuity of the transformation expresses the intuitive 
statement that time changes continuously. Requirement 2 is intuitively 
obvious; it is a statement of the fact that letting a physical state evolve first 
for a period of length t, and then letting the resultant state evolve for a 
period of length t, leads to the same result as letting the original state 
evolve for a period of length t, + t,. Requirements 1! and 2 are very general 
and also include irreversible processes. 

Weare here concerned with the quantum mechanics of reversible processes, 
and for such processes we make the additional requirement 


3. The time evolution (1.1) is given by a unitary operator U(t) satisfying 


W(to) > W(t,) = U'(t1)W(to)U(t,), (1.2) 
GU ~ (2), (1.3) 
U(0) = 1 (1.4) 
Uz) = U(—1), (1.5) 


U(t, + t2) = U(t,)U(t2) (—0 <%,1,< +0), (1.6) 


U(t) is a continuous operator function’ of the param- 
eter T. (1.7) 


' See appendix to this chapter. 
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Equation (1.5) follows from (1.2) and (1.3)—specifically from 
U(t,)W(t,)U~*(t1) = W(to) (1.2’) 


—and from the assumption that (1.2) is also valid for evolution backwards 
in time, i.e., 
W(to) = U-'(—11) W(t,)U(— 1). (1.8) 


Equation (1.6) follows from requirement 2, which, when the time evolution 
is described by a unitary operator as in (1.2), is expressed as 


U(t, — to) = U(t, =i) JUG —1,)- (1.6’) 


Equation (1.6) follows for t; =.t,; — to, t2 = tz — t,. The linearity of the 
transformation (1.2) is obvious. The continuity of the transformation (1.1) 1s 
expressed by (1.7). We shall not give here the mathematical definition of a 
continuous operator function, but mention only that the formal operations, 
employed below, of differentiation, Taylor series expansion, etc.” can be 
rigorously defined. A continuous operator function satisfying (1.6) is called a 
one-parameter group of operators. 

We differentiate U(t) with respect to t and define 


(1.9) 


dt = 
A is called the infinitesimal operator or generator of the operator U(t) of 
time evolution. U(t) is not only “differentiable,” but is differentiable an 
arbitrary number of times..We can calculate all “derivatives” using (1.6). 
Differentiating (1.6) with respect to 7,, 


dU(t + 7) d(t + 7) _ dU(z,) U 


dc <a, i 
and evaluating the resultant expression at t, = 0, we obtain 
dU 
© _ Aun) 
dt 


We differentiate this again and obtain 


PU(e)_ dU) 


= A- AU(z) = A-UG). 


dt? dt 
We may continue to differentiate, obtaining for the pth derivative 
d?U(t) 
—PATUCLy: 
de (t); 
use of (1.4) then gives 
d?U(t) _ yp 
ae t=0 


* See the appendix. 
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The “Taylor series expansion” of U(t) is therefore given by 


os ae tv n . 
UM=a1+ath a+ tg a +s (1.10) 
or, if we compare (1.10) with the series for e*4, we have 
Ug (1.10’) 


In particular this means [according to our rule that f(A)|a> = f(a)|a> 
for functions f(A) of A] that 


U(t)|a> = ea) 
for (generalized) eigenvectors |a> of A corresponding to the eigenvalue a. 
Taking the adjoint of (1.10) gives 
2 
te = fe tee erst 
U(r) =1+7A on +o =e,” (1.11) 


If we use (1.3) and differentiate U'(z)U(t) = I, we obtain 


sb is aoe 


Uz) + UG) —— = 0; 


setting t = 0, we obtain 

At = —A. a lel) 
An operator that fulfills (1.12) is called skew-Hermitian. As t is the time, A 
has the dimensions of frequency, i.e., of energy/h. The operator 
dU(t) 


H = —ihA = —ih > 
dt t=0 


(1.13) 


often also called the generator of time evolution, is therefore a Hermitian 
operator with the dimension of energy. H is the Hamiltonian operator or 
energy operator of the system, and, from (1.10’), 


U(t) = eit, (1.14) 


What H is depends, of course, upon the particular physical system under 
consideration, and how it is found in particular cases has been discussed 
previously. H is an element of the algebra of observables of the particular 
system, and its properties are determined by the defining algebraic relations 
of the particular physical system. We remark that (if one uses a precise 
mathematical formulation—something that has not been done here) 
one can show that on the space of physical states there is a one-to-one 
correspondence between U(t) and H; one can therefore consider H as a 
quantity derived [using (1.13)] from the more fundamental time-evolution 
operator U(t), or one can consider H as the more fundamental quantity 
and the time-evolution operator U(t) as a quantity derived [using (1.14)] 
from H. It is in the spirit of our preceeding formulation to consider H, an 
element of the algebra of observables, as the fundamental quantity, and to 
consider U(t) as the derived quantity. 
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We formulate the result of our preceeding plausibility argument as the 
basic assumption of quantum mechanics for time evolution—the basic 
dynamical law for quantum-mechanical systems: 


Va (The Schrédinger Picture). A (conservative) physical system has a 
generator H of time translation, which is a Hermitian element of the algebra 
of observables and which is characteristic of the physical system. The time 
evolution of the state of the physical system is given by 


W(t) = UT()Wm U(t), (ay 


where W, is the state of the system at some initial time tg = 0 and where 
U(t) is given by 
Oye (1.16) 


According to the axiom II, the expectation value of an observable A is given 
by 
<A> = TtAW), (1.17) 


The observable that is represented by the operator A is defined by the pre- 
scription for its measurement, i.e., how to set up a measurement apparatus 
and how to take the measurement. This prescription is given for all time and 
does not change with time. The experimentally measured value changes in 
time; if we make a measurement at a time fy, the expectation value <A), -,, 
will, in general, be different from the expectation value <A),-,, at a different 
time t,. This experimental fact is represented by a description of the change 
of the state W(t) according to (1.15). Inserting (1.15) into (1.17), we obtain 


<A>, = Tr(AW(t)) = Tr(AUt(t)WU(t)). (1.18) 
This can be written 
; (A>, = Tr(U(AUT(1)W), (119) 


since the trace of a product of operators is invariant under a cyclic permuta- 
tion of the order of the factor operators. We can now define a new operator 
A(t) by 

A(t) = U(t)AU‘(t) (1.20) 
and write (1.19) as 


<A), = (AW) = Tr(A@W). (1.21) 


In (1.21) the change of the experimentally measured value <A), with time is 
described in a form that can be interpreted in the following way: The state 
of the system is described by an operator W,; this statistical operator Wo 
does not change with time, i.e., the whole history of the state of the physical 
system 1s described by the one operator Wy. The observable develops with 
time according to (1.20), 1e., the prescription for the measurement must 
include instructions as to when to take the measurement, because the ob- 
servable -the experimental setup—is different for different times. Thus 
instead of the basic assumption Va we can formulate the time evolution by 
the following basic assumption: 
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Vb (The Heisenberg Picture). A (conservative) physical system has a 
generator H of time translation, which is a Hermitian element of the algebra 
of observables and which is characteristic of the physical system. The time 
development of every observable A(t) of the physical system is given by 


A(t) = U(t)AU(t), (1.22) 


where A describes the observable at some initial time t) = 0 and where U (t) 
is given by 


U(t) = eth. (230 


These two descriptions of the change in time of the physically measured 
value <A), are, of course, completely equivalent. In the form Va the change 
in time is described as a change of the state of the system. The description of 
the change in time in this form is called the “Schrédinger picture.” In the 
form Vb the state is held fixed and the change in time is described by the 
change of the observable. The description in this form is called the “ Heisen- 
berg picture.” 

The time development of an observable as given by (1.22) is often written 
in a different form, which is obtained from (1.22) by differentiating: 


dA(t) a i HeitHih 4g itHin 4 pitting ( _ i H)e7ithik 
dt h h 


= + (HAC) — A(t)A), 


or 


i gat) = {ACH |: (1.24) 
dt 
This is the “Heisenberg equation of motion.” An observable is a constant 
of the motion if its does not depend upon t. From (1.24) it follows that all 
constants of the motion commute with H. In particular, H itself is a constant 
of the motion. Furthermore, every member of any c.s.c.o. that contains H is 
a constant of the motion. 

Consider the particular observables momentum P; and position Q; for a 
physical system in which neither of them is a constant of the motion. It is 
easily seen that [P,(t), Q;(t)] has its usual form for all times t > 0 given 
that [P;, 2] = (h/i)6,,I at t = 0: 


Pi(1)0,(0) — O,(0)PA0) = UQP,U(QU(DG, UN) — UNO,UQUOP, UN) 
Sac )(P,0, 0, P)UG) 


he 


(P(t), 0,(0)] = 2 5,1. (1.25) 


i 


280 Time Evolution 


We are now in a position to calculate the equations of motion for the 
momentum and for the position and to show that these equations have the 
same form as those for the classical quantities. For a system whose 
Hamiltonian is given by 


1 
H = eee A); 
2m 


where 


Q; = 0X0 and P; = P(t), 


we calculate that 


1 1 
[Q,, H)= Fm Loe P;P;] a Byte aie P;} 


ih ih 
oe i Pee a pe 
Im wil, P;} m Lee 
Inserting this into (1.24), we obtain 


dQ,(0) _ Palo) 


ae le 
dt m C2 


This is the quantum-mechanical analogue of the classical relation between 
momentum and the time derivative of position. 
In classical physics the force is obtained from the potential function 
V(x) by 
OV(x) 
Ox 


= 


In analogy to this we define the “force operator” by 


“2V(Q)” 
10, 


where the last symbol is defined below. Inserting this into (1.24) for A(t) = 
P,(t), we obtain 


F, = —;[P, VQ) = - (1.27) 


dP,(t) 1 PH i 
dt > ip LPe HI = — 7 Pe VQ) = FAQ). (1.28) 


This is the quantum-mechanical analogue of the classical relation between 
the time derivative of the momentum and the force. Combining (1.26) 
and (1.28) gives the quantum-mechanical Newton’s equation, 


ad? 
mn “ ) — FQ). (1.29) 
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[“eV(Q)/0Q,” is a symbolic way of indicating the following pro- 
cedure: Replace Q; by the numbers x; in V(Q) [recall that this is just 
the inverse to the way V(Q) was obtained from the corresponding 
classical expression], differentiate V(x) with respect to x,, and 
replace x; by Q; in 0V(x)/dx,. That “6V(Q)/0Q,” is in fact (i/h) 
[P,, V(Q)] can be seen in the following ways: 


Ne 


Write 
V(Q) =C+ C;Q; sf OO OF + --- 
aR Ci, Qi, dO aa aE) (1.30) 
where the C;,, ...;, are symmetric, i.e., 


(1,31) 


ict igs tip in? 


because the Q; commute with each other. Using the canonical 
commutation relations (1.23), calculate that 


5 EPL. VQ) = Gy + 26,0; + 


+ NCyi i, ---i,Q0;,0i,°°°Q;, + +++, (1.32) 
because 


Ci, a Uae Q;, aad Q;,,] 
a C;, in 2 UPi. Q;,10:, oe OG, One a Q,, 


n 


h 
= Crear De 5 Onin Qi peers 0. 207. aa 0,, 
p=1 


} 
Sol 5 Cre ie, Ci, - Cis 
where (1.31) has been used. The right-hand side of (1.32) is 
identical to “@6V(Q)/dQ,.” 
Calculate [P,, V(Q)] in the position representation: 


CxI[P,, VQ) = <xIPLAV(Q)IY>) — <x VIQ)P. IY 
h oO 


h oO 

= aS «x [V(Q)IY> — Oe <x|p> 
h od h o 

= in = BO x, «xl 


_ hOVv(n) 
ou ein 


Kxly>. | 
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As the Heisenberg commutation relations do not change with time, all 
algebraic relations that are derived from them are also valid at any time, 
even if the operators in these algebraic relations do not commute with H. 
That is, any algebra of observables whose elements are functions oP, 0; 
does not change in time, even though some of the elements may not commute 
with H. 

A symmetry transformation of a set of elements is a transformation of the 
elements that leaves the relations among them unchanged. Thus, the time 
evolution of a physical system whose algebra of observables is derived from 
the Heisenberg commutation relations is a symmetry transformation of the 
algebra of observables. If time evolution is a symmetry transformation, then 
the mathematical structure (in particular the algebraic relations) of the 
algebra of observables does not change in time; this means that the physical 
structure is indistinguishable at two different points in time. Our experience 
shows that there are physical systems that have this property, and in fact it is 
this property that defines the isolated physical systems. Thus isolated physical 
systems do not age, an absolute value of time has no meaning for these 
systems, and only time differences are accessible to measurement. Irreversible 
processes do not take place in isolated physical systems defined as above. 
These considerations lead us to another formulation of the basic assump- 
tion V: 


Ve. The time evolution of a conservative physical system is given by a 
continuous symmetry transformation of its algebra of observables. 


Let us return to the Schrédinger picture. The time evolution (1.15) can 
also be expressed in differential form. eee eae (1.15), one obtains 


dWwit i 
—— i 5 (WOH — HW(t)) == ; (WO, A}. (1.33) 
We consider now the om case where the state W is a pure state, 1e., 


Wo = Aes = IWo><Wol, (1.34) 


where Aj,,) is the projection operator onto the one-dimensional space 
spanned by the vector |». For this case (1.15) reads 


W(t) = UT()Ay,) UE) = UT) Wo> Hol UC). 
We define the vector |(t)> by 
Iy(t)> = UTH)|Wo> (1.35) 
and have 
W(t) = U'@)Awyoy UE) = [WD <Ol = Awa: (1.36) 
We obtain the differential form of (1.36) by differentiating (1.35): 


d dU 
4 Woy = A te = - L HUD Ive = - + HIVeOY. 
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This is the Schrédinger equation for the state vector |W(t)>: 


d 
ih 7 |W(0)> = H|v>. (1.37) 


For the Hamiltonian given by 


the Schrédinger equation in the position representation reads 


d 1 
ih 7, SXIWOD = 5 <xIPiPilWO> + VOO<x|Y(OD. 


Using 


hh o 
(xl PPM) = 22 = © CxIM(o 


(see Equation (II.8.64)), the Schrédinger equation may be written in the 
conventional form 


od h? _, 
ih 5, VO 0 = — 5 VU, 1) + VOW, 0, (1.38) 
where we have defined the time-dependent wave function 


W(x, t) = <x| (>. (1.39) 


Summarizing, the distinction between the Heisenberg picture and the 
Schrodinger picture is the following: In the Heisenberg picture the observa- 
bles are transformed by U(t) and the states are kept fixed, while in the 
Schrodinger picture the observables are kept fixed and states are transformed 
in the “opposite direction,” i.e, by U'(t) = U~‘(t). The time dependence 
of Tr(AW) is the same in both cases. Expressed in terms of vectors: In the 
Heisenberg picture the eigenvectors |a t) of the observables A(t), 


A(t)|at) = alat), (1.40) 
are “rotated” by U(t): 
U(t)|a0> = lat), 


whereas the state vectors |~)> are kept constant. [Note that in (1.40) no 
time dependence occurs on the part of the eigenvalue a. This is easily shown: 
A(t)|at> = A(t)U(t)|a0> = U(t)A(0)|a0> = U(t)a(O)|a0> = a(O)|at>.JIn 
the Schrédinger picture the eigenvectors |a0> are kept constant and the 
state vectors are “rotated” in the opposite direction: 


U'()|Wo> = |W(t)>. 
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The time dependence of the physically measurable quantities representing 
the probabilities is in both cases the same: 


[XatlWo>l? = |(U(|4a0>, |Wo>)? 
= |<a0|U'() Wor? = |<a 0|W(O)>/?. 
A state is called a stationary state if it does not change with time: 
W(t) = UTM Ut) = W = W. (1.41) 


This terminology is justified by the fact that every observable measured in 
this state gives a value independent of the time of the measurement; for from 
(1.41) it follows that the expectation value of any observable A is 


<A), = Tr(AW(t)) = Tr(AUt(t)WU(t)) = Tr(AW) = <A), <o- 


Stationary states are always mixtures of eigenstates of the energy operator H. 
This follows from (1.41) in the form 


Weithih = cttw 
or equivalently, 

WH —- HW=0. (1.42) 
In particular, for a pure stationary state, 


W= Aiwoy = lWo><Wol, 


(1.42) reads 
lWo><WolH aa A\ho><Wol, 
or ; 
_ WolH|Wo> 
; H|%o> = |Wo> aT (1.43) 


Thus pure stationary states are represented by eigenvectors of the energy 
operator. The time dependence of a stationary state vector is given by 


IW()> = UTO|Wo> = ec" |Wo. (1.44) 


Stationary systems are systems that have only stationary physical states. 
Stationary systems can therefore only be prepared in states that are mixtures 
of eigenstates of the energy operator, ie., in states that fulfill (1.42). And 
pure physical states of stationary systems must always be represented by 
energy eigenvectors. In the preceeding chapters all physical systems con- 
sidered were stationary physical systems, and we have now found an “explana- 
tion” for the fact that these systems always appeared in eigenstates of the 
energy operator. 

To what extent a given physical system may be considered stationary 
depends upon the desired accuracy of the description. If one ignores details, 
atoms and molecules are stationary physical systems. If a measurement 


Time Evolution 285 


process has prepared them to be in a particular state, which has to be in an 
energy eigenstate, then they will for “all time” remain in that state. “All 
time” means fairly long on atomic scales: 10~? sec (for infrared transitions) 
to 10~° sec (for ultraviolet transitions) for the excited states of atoms and 
molecules. If one wants to describe not only energy levels but also details of 
the decay process of excited states, then one cannot consider the atom as a 
stationary system. This will be discussed in later chapters. 

The Heisenberg picture and the Schrédinger picture are the limiting cases 
of a more general picture, which is called the “Dirac picture” or the “inter- 
action picture.” One writes the Hamiltonian operator H as the sum of two. 
parts, 


H = Hy, + Tie (1.45) 


How this splitting is performed, i.e., how one makes the choice as to which 
part of H is to be Hy and which part is to be H,, depends upon the particular 
situation. In principle any part of H can be called Ho and the rest then called 
H,. In practice one chooses for Hy an operator whose eigenvalues are known 
or can easily be obtained. Often Hy will represent the energy of two or more 
combined systems when the interaction between the systems is ignored; 
H, then represents the interaction energy. 

Assuming that H is not explicitly time-dependent (a conservative system), 
we define, in accordance with (1.16), 


U@)=e7" (1.46) 
The expectation value of an observable A at time t is then given by 
<A> = In U()AU (OW), (1.47) 


where A represents the observable at some initial time tg = 0, and where Wo 
is the state of the system at that initial time. If we define operators 


U p(t) = eitHols (1.48) 
and 
UG@® = UOUL®) = Ube ™, (1.49) 
then (1.46) may be rewritten 
<A>, = Tr(U (t)U (AU (UT ()Wo) 


= Tr(Uo(AU GU (Wo U (0) 
= Tr(A?(t)W(t)), (1.50) 


where 

A(t) = U,(t)AUG(t), Cio!) 
and where 

W(t) = UIHWU, (0. (1.52) 
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Note the similarity between the time development (1.22) of A in the 
Heisenberg picture and the time development (1.51) in the Dirac picture. 
In the former the evolution operator U(t) = e'“#" is generated by the full 
Hamiltonian H, while in the latter U,(t) = e“¥°/* is generated by only part 
of the Hamiltonian, namely Hy. In the extreme case that Hy = Hand H, = 0 
we have U,(t) = U(t) and U,(t) = IJ, so that (1.50), (1.51), and (1.52) re- 
duce to the Heisenberg picture. Alternatively, there is a similarity between 
the time development (1.15) of the statistical operator W in the Schrédinger 
picture and the time development (1.52) in the Dirac picture. This similarity 
is even more pronounced if . 


[Ho, H,] = 9, (1.33) 
for then (1.49) becomes 
U,(t) = eit /he — itHo/h Pas eit Halk (1.54) 


In the case (1.15), then, U(t) = e” is generated by the full Hamiltonian 
H, while U,(t) = e'”*’" is generated by the interaction part of the Hamilton- 
ian, namely H,. In the extreme case Hy = Oand H, = H, we have U,(t) = I 
and U,(t) = U(t), and we thereby regain the Schrédinger picture. 

Suppose W, describes a pure state, 


Wo = |Wor<Wols (1.55) 
then we can define a vector 
WO>n = UiM|Yo> (1.56) 
such that, according to (1.51), W(t) is expressed as 
W(t) = |W)» ov <WO)I- (1.57) 


Differentiating (1.56), one obtains for the most common case that (1.53) 
holds: 


d 
ih = WO) = Hil). (1.58) 


Thus the vectors representing the state in the Dirac picture obey the 
“Schrodinger” equation with the interaction Hamiltonian H,. 

In the interaction picture the time dependence of the physically measurable 
quantity (A), is described by letting both observable and state vary with 
time. The observables are rotated in one direction with a transformation 
generated by part of the Hamiltonian, and the states are rotated in the op- 
posite direction by a transformation generated by the other part (the inter- 
action part) of the Hamiltonian. Expressed in terms of vectors: The eigen- 
vectors |at)>p of the observable A, 


A*(t)|a t>p = a(t)|at)p, 
are given by a rotation by e"”° of |a0)p, 


|at>p = eM" a0>p; 
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one obtains the state vectors |Y/(t)>p by rotation in the opposite direction by 
Ui() (=e "if TH, Hy] = 0) of |W): 


IY@)>n = UL(O|Wo>. 


The time dependence of the probabilities is the same as in the case of the 
Heisenberg and Schrédinger pictures: 


[<at|Wo>l? = |<a0| UT) |Wo>/? 
= |<a0/ UKHULO |W! 
= |p<at|W(t)>p/’. 


Isolated systems are an idealization, because a physical system cannot be 
completely isolated from the rest of the world. However, in many cases these 
idealizations give a sufficiently good approximation to the physical situation. 
If the isolation between a physical system and some part of the rest of the 
world is not negligible, one can enlarge the physical system by this part of 
the rest of the world and then consider the combined system. If that is not 
sufficiently isolated, one can continue to enlarge the physical system and will 
ultimately end up with an isolated system to which the previous considera- 
tions about time development apply. In many situations, however, this 
procedure is not practical. The enlarged system might be too complicated 
to be described by a sufficiently simple mathematical structure (if such a 
structure exists at all). In such a case it is useful to consider nonconservative 
systems, in particular systems with time-dependent external forces. 

For such systems the physical structure changes with time, and so does 
the mathematical structure that describes it. The observables, in particular 
the Hamiltonian, will depend explicitly upon time; and thus the dynamical 
laws wiil change with time. For such systems it is not possible to give the time 
evolution by a unitary operator, as in (1.15) and (1.16) or as in (1.22) and 
(1.23). But one can still try to express it in a differential form in analogy to 
(1.24) or in analogy to (1.33). The Hamiltonian operator, which is no longer 
the energy operator of the system, depends explicitly upon time. In this case 
the time dependence may be formulated in the following form, which is 
suggested from correspondence to classical mechanics: 
BAe Ap aIOI + (1.59) 

ih Ot 
where OA(t)/ot is the derivative of A(t) with respect to its explicit time 
dependence. If one assumes that every observable A is a function A = 
A(Q((t), P(t), 1) of Oft) and P,(t) with an additional explicit dependence 
upon t, then (1.59) is obtained from the corresponding equation in non- 
relativistic classical mechanics by replacing the Poisson bracket by 1/ih 
times the commutator. Equation (1.59) is a generalization of (1.24) and, in 
the case of an isolated physical system, goes over to (1.24), which corresponds 
to the classical expression for conservative systems. For systems that have 
quantum-mechanical observables that cannot be expressed as functions of 
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Q(t) and P,(t), it is not clear whether a Hamiltonian operator can be found 
such that the time development can be expressed by (1.59). Equation (1.59) 
can no longer be integrated into a form analogous to (1.22); even for the 
case that A does not depend explicitly upon time [@A(t)/dt = 0], there does 
not exist a general integration theory for the solution of 


dA(t) 1 


a aa LA(t), H(t)]. (1.60) 


XII.A Mathematical Appendix: Definitions and Properties of 
Operators That Depend upon a Parameter 


Let S be a linear space in which a limiting process is defined ; such a space 
is called a linear topological space. For example, in the Hilbert space S = # 
a limiting process is defined in the following way: A sequence ¢,, (>, $3,... 
is said to converge to dc #, denoted 


bn > o, 
iff 
l?,-— ¢]| ~0 asn—> oo. (A.1) 


The Hilbert space # is not the only linear topological space; if one has a 
linear space, one can equip it with all kinds of topologies, i.e., one can give 
all kinds of meanings to the convergence of a sequence. For example, instead 
of one norm—as in the Hilbert space -one can define in a linear space S a 
countable sequence of-norms || ||, 


IPllo S Iola < Illa <--- forall deG, 


and define convergence in the following way: A sequence of elements {¢,} 


converges to @ with respect to the topology given by the countable sequence 
of norms, denoted 


on > >, 
iff for every p = 0, 1, 2,... 


Pn as lp =a) a9. =e co. (A.2) 


Such a linear topological space S is called a countably normed space. In 
quantum mechanics one usually uses the definition of convergence (A.1), 
L.e., one uses the Hilbert space rather than any other linear topological space. 
No physical justification for this choice (made by J. von Neumann) exists. 
The notions of “continuity,” “differentiability,” etc. are defined with 
respect to a certain topology or meaning of convergence. Hence continuity 
in a Hilbert space has a meaning different from continuity in a countably 
normed space or continuity in any space with another kind of topology. For 
a given linear topological space G, then, we say that an operator function 
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A(t) (e., an operator which depends upon a parameter ¢) is continuous at 
t = to iff for any de S 


A(t)p S Alto) ast > to. (A.3) 


A(t) is said to be continuous iff (A.3) holds for all values of to. If one assumes 
that in quantum mechanics the space of physical states is the Hilbert space, 
then the vague statement made about the continuity of the time-evolution 
operator U(t) has the precise meaning given by (A.3), where 


“wo © “cH 9 
=> means ~ 


’ 


i.e., U(t) has the property 
|UOd — Ulto)Pl| +0 ast ty. (A.4) 


As mentioned in several places in particular in section II.10 there is no 
physical reason for which the Hilbert-space convergence is preferred over 
any other definition of convergence, and therefore there is also no reason to 
give the vague physical meaning of continuity the precise mathematical 
definition (A.4). One could as well have chosen any of the other topologies, 
ie., used the definition (A.3) with any other mathematical definition of con- 
vergence in GS. In the numerous derivations of the Hilbert space for quantum 
mechanics, it is usually at this point (i.e., the choice of (A.4) for the vague 
physical meaning of continuity) that the Hilbert space is sneaked in. 

An operator function A(t) is called differentiable at t = to (with respect to 
the topology on GS) in a subspace 9 © G if for any Pe D 


A(ty + At) — A(t) 
At 


i.e., if the limit of (A(t) + At) — A(to))/At)d exists in S. The vector w defines 
a linear operator which we shall denote by dA(t)/dt|,_, , 


dA(t) 


Me 


doweS asAt 0, (A.5) 


t=fo 


and which is called the derivative (with respect to the topology on GS) of 
A(t) at t = to. Thus the derivative dA(t)/dt|,_,, is defined by 


 =lim® A(t + At) — A(t) 


At>0 At 


o. (A.6) 


F— tg 
if one chooses the Hilbert-space convergence, then one writes 
dA(t) o = lim” A(ty + = — A(to) 


dt t=to At~0O 


The action of the derivative dA(t)/dt|,-,, is defined on all vectors be Z € G, 
i.e., on all vectors for which the limit (A.5) exists. 
An operator function A(t) that satisfies 


A(t, + tz) = A(t,)A(t2) (A.8) 


. (A.7) 
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and is continuous with respect to the Hilbert-space topology is called a 
one-parameter group of operators with respect to the Hilbert-space topology. 
For such a one-parameter group of operators the following properties can 
be proved: 


1. A(t) is differentiable on a “dense” subspace 9 < #. This justifies 
(1.9), not for all vectors, but for all vectors in J. 
2. On &, A(t) can be differentiated an arbitrary number of times. 
3. The sequence {¢,}, where 
. n ie d* A(t) 
Unies 2 k! dt* he 


=0 


converges (with respect to the Hilbert-space topology) for all elements 
@ of a dense subspace 9 € GC H. This justifies (1.10). 


Problems 


1. H,a, and a’ are the operators as defined by (II.3.1) and (II.2.1b). Using the 
axiom of time development (V), calculate da/dt and da‘/dt. 


2. The Hamiltonian H for a harmonic oscillator subject to an external force K(t) 


is given by 
2 2 
ee Ona OK 
2m D 

Here Ho, not the Hamiltonian H, is the total energy. 

(a) Express H in terms of the operators a, a‘ and the identity operator. 

(b) Using the axiom on time development, calculate da/dt. 

(c) Solve the differential equation obtained in part (b) and find the explicit ex- 

pression for a(t). 
3. For the harmonic oscillator under the external force K(t) in Problem 2, the 

orthonormal eigenvectors wy, of N = a‘a can be constructed in a manner analogous 
to that used in the force-free case. One finds 


i 
W(t) = a a'(t)Wo(t), 


where W(t) represents the ground state. At the time t = 0, the energy of the harmonic 

oscillator is measured to be fm/2. Denote the ground state at t = 0 by dp = vo(t = 0); 

(a) What is the statistical operator for the harmonic oscillator after the energy 
measurement? 


(b) Show that 1, = |(W,,.o)|? is the probability for measuring a value of the 

energy E = hw(n + 5) at some later time t. 
(c) Show 

2n 

Ww, = Fl e7 FOr? 
n! 
where 
F(t) = — [ Kee dt. 
\/2mho Yo 
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(d) Show that the expectation value for the position operator immediately after the 
energy measurement at t = 0 is 


it eae 
<Q> = as [xe sin (t — t) dt. 


4. The harmonic oscillator under the influence of an external force 


KG) abe Ss (0). 


Ko = 1m f7 <0 


is at the time t = 0 ina state in which the energy is hv»/2. The property that the energy _ 
at a time f is equal or smaller than hw(m + 4) is characterized by the operator 
A = yir-o Aye» Where Ay, are the projection operators on the one-dimensional 
subspace spanned by w(t). 

(a) What is the probability for the measurement of A. 

(b) What is the statistical operator W, after the measurement of A with a positive 

result? 
(c) Show that 


. iin’ 1 ae 
Wy = » py, mot _____ (F)"|F*|"e-IF?_ ifn’ < m, ifn’ > m. 
n=0 


/nin'! 


CHAPTER XIII 


Change of the State by the Dynamical 
Law and by the Measuring Process— 
The Stern—Gerlach Experiment 


As an illustration of the characteristic features by which quantum mechanics 
differs from the classical theories, this chapter discusses the change of the 
state by a measurement. In a Gedanken experiment with the Stern—Gerlach 
apparatus it is shown that a polarized beam (a pure state) cannot be split 
by the magnetic field and that the splitting of such a beam is a consequence of 
the measurement. 


XIII.1 The Stern-Gerlach Experiment 


In the preceding chapter we have formulated how the state of a system 
changes in time as a consequence of the dynamical law. In Section II.5 we 
have described another kind of change in time, the change of the state by a 
measurement. These are two completely different processes of time change. 
In this chapter we shall illustrate these processes by considering in detail a 
Gedanken experiment with the Stern—Gerlach apparatus, which was briefly 
described in Section IX.1. 

In the Stern—Gerlach experiment (Figure 1.1), a beam of hydrogen atoms 
in the ground state passes through a strongly inhomogeneous magnetic 
field H. Under the conditions of this experiment the hydrogen atoms may 
be considered to be the combination of two elementary physical systems 
(cf. the basic assumption IVa of Section II.5): Physical system I is the ele- 
mentary rotator with angular momentum 3, and describes the spinning 
electron; physical system II is the elementary particle that describes the 
motion of the structureless hydrogen atom in the experimental setup. 
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Figure 1.1 Gedanken experiment with two Stern—Gerlach devices. 


The space of physical states of system I is the two-dimensional spin space 
KH, = * = R'" (cf. Section 11.3). As a basis for #, we choose the eigen- 
vectors | + > and | — > of the spin component S; in the direction of the magnetic 
field H, with! 


S3l+>=+2/1+> and S3|—> = —}|-). (1.1) 


The magnetic moment of this system is [Equation (IX.3.18)] 


M = —2 


Fire S = —2p,S. (12) 
The energy operator is a constant; because of our freedom of choice in 
defining the zero of energy, we may set this constant equal to zero: H, = 0. 
The space of physical states of system II is the space #,, spanned by the 
generalized eigenvectors |p) of the momentum operator P or by the general- 
ized eigenvectors |x> of the position operator Q; Q and P are the position 
and momentum of the hydrogen atom considered as a whole. The energy 
operator for system II is just the kinetic-energy operator 


Ay — P?/2M, (13) 


where M is the mass of the hydrogen atom. 

System I and system II are coupled by the external magnetic field H;*" 
according to Equation (IX.3.1) and (1X.3.18) the interaction energy operator 
that represents this coupling is 


Hi. = —M-H = —(M@/)-(U @ HR) = 2883 H3(Q) = 2S, H3(Q3). 
(1.4) 
' We are again using units where fh = 1. 


'? Tn the following, we use the same letter H for the components H;, of the magnetic field as for 
the Hamiltonians H, H,, Hy, etc. 
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In (1.4) we have approximated the magnetic field H as having only a z- 
component and as being a function of z = x, alone.* Consequently the energy 
operator of the physical combination of these two systems, i.e., of the hydrogen 
atom in the ground state with electron spin in the magnetic field, is 


i 
H =H, + Hy + Hin =a + 2gS-H(Q) 


1 
= 547 P* + 253 H3(Q3). (1.5) 


In choosing this combination of I and II for the description of the hydrogen 
beam in the magnetic field we have neglected (1) the magnetic moment of 
the proton, (2) the influence of the magnetic field H upon the electron- 
proton interaction in the hydrogen atom, and (3) the difference between the 
electron position and the position of the hydrogen atom as a whole. The 
justification of (1) and (2) may be seen by an examination of the orders of 
magnitude involved: (1) The magnetic moment of the proton is [cf. 
(IX.3.18,)] M, = (2 + 3.59)(e/2m,c)S = 5.59(m,/m,)gS. Since the mass of 
the proton is much greater than that of the electron (m, = 1836m,), it follows 
that the magnetic moment of the proton is almost three orders of magnitude 
smaller than the magnetic moment of the electron; the interaction energy of 
the proton magnetic moment in the magnetic field is therefore negligible when 
compared to (1.4). (2) The Bohr magnetron pig = 5.795 x 10-° eV/gauss is 
sufficiently small that even in a fairly strong field of 10° gauss the magnetic 
interaction énergy jg H is six orders of magnitude smaller than the difference 
between the lower energy levels of the hydrogen atom. Thus the internal 
states of the hydrogen-atoms do not change significantly; and the operator 
for the internal energy of the hydrogen atom [essentially the energy operator 
(VI.3.2) of the Kepler problem] may be considered a constant for the present 
problem, which we set.equal to zero. Instead of the above described space 
H,, the space of states of system HI should be #7), = “y ® &, where # is 
the space of states for the Kepler problem (cf. (VI.5.2)]. But as we are keeping 
the hydrogen atom in the ground state [1.e., in the one-dimensional subspace 
Bn = 1) < &], the space of states for system II is in fact #y, @ # (n = 1), 
1.€., 1S Hy. 

The experimental arrangement is such that at t = 0a pulse of hydrogen 
atoms enters the region of the magnetic field at the point A (cf. Figure 1.1) 
and moves with an average momentum fp in the y direction. Idealizing, we 
assume that in this experimental arrangement a pure state of system IJ, has 
been prepared. This state is described by the state vector (statistical operator) 


IWo> = (Wr = [Wo Hol) (1.6) 
in the Heisenberg picture for all times t > 0, or by 
IWo> = (W(9) = |Wo> Hol) (1.7) 


* The Maxwell equation V-H = @H;/éx; = 0 tells us that the variation of H, with x is just 
as great as the variation of H, with z. We shall return to the subject of this approximation in 
the appendix to this chapter. 
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in the Schrédinger picture at time t = 0. System I may be in a pure state with 
statistical operator 


W=|0><b| where|d>=a1+>+Bl-—> (BEC) (18) 
if the beam has been polarized before it reaches point A. (The state |) will 
be an eigenstate of f'-S for some direction fa determined by the choice of 
a and .) Alternately the system I state may be a mixture, e.g., 


W,= AN? =H14+>¢4+1 + |-><-]) =4/, 9) 


where A‘? is one-half the identity operator in the spin space #, = *°. 
The state of the combined physical system is described by 


W=WOW, or |x>=16> @ lod, (1.10) 


the second description in terms of the state vector |7> being an alternative 
in the case of a pure system I state to the description in terms of the statistical 
operator W.”* 

In considering the time development of the pulse of hydrogen atoms we 
shall use both the Heisenberg and the Schrédinger picture. By use of 
(XII.1.59), (1.5), and the canonical commutation relations, we see that the 
Heisenberg equations of motion for the position and momentum operators 
are 


d 
Se = = [Q(0, HiO] = 42 POO, (41) 


dP(t) 1 ae = a 
en fea HO)|= 2H S3L P(t), H3(Q3(0)] = (0 Us aa 


(1.12) 


{The meaning of 0/0Q; was given in the paragraph following Equation 
(XII.1.29).] Let us first investigate the case that system I is in the spin state 
|+ > or |—), so that the state of the combined system is 


Ixt> =|+) @ |o>. (1.13) 


The motion of the expectation value of the momentum operator in these 
States is then given by 


d 
Ae OLE? =(0 0, 0, — 2p—¢ + |S3]+><Hol ot o> 


= (0 0, Fus<Wol a5 5 8 Io) (1.14) 


= (t) 


24 Remark on the state of a combination of physical systems: Recall that the operators in the 
direct-product space are given by Equation (III.5.7). A general statistical operator in. 4, @ #y 
is therefore not given simply by the direct product of statistical operators in #, and in #y. 
However, if either of the states W, or W, is a pure state, then Wand W, uniquely determine the 
state W of the combined system as the product W = W, ® W,,. This is the situation in the prob- 
lem under investigation. W is also given by W = W, ® W,, if the state of the combined system 
is determined by measurements that have been performed upon systems | and II separately. If, 
however, measurements are performed that measure correlated properties of systems [ and II, 
then in general W is not factorable into the form We@® Wy. 
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Thus if the state has spin up, s, = +4 (spin down, s; = —4), i.e., if the state 


is |+ > (|—)), then the time development of the expectation value of the 
momentum’s z-component fulfills 
.) 


Fin Yo 
4) 
zs Fe | Px? Wolxt) Gxtlo> (1.15+) 


d 
FPS ee 1P lye 


aH, 
0Q3(t) 


where the |x tf) are the time-dependent eigenvectors of Q in the Heisenberg 
picture. [The upper (lower) sign in (1.15 +) refers to the spin-up (spin-down) 
case. ] The right-hand side of (1.15 +) shows that the change of the momentum 
depends upon the particular properties of the magnetic field; let us assume 
for simplicity that 


ome = const > 0. (1.16) 
0x3 
Then 
d — — 
ee |Pa@ly +> = Feholvo> = +K, (1.17+) 
where 
K = Lp es (1.18) 
Ox; 


Using the initial condition that the particle has momentum 
x + (PC = Oix+>= (6,0) att =0, 


we may integrate (1.17+); the expectation value of the momentum operator 
at time ¢ is then : 


<X + IPOIX E> = ©, B, F xt). Cages.) 


If we take the expectation value between |y +) of (1.11) and use the initial 
condition <7 + |Q(t = 0)|y +> =0 together with (1.19+), then the 
expectation value of the position operator is 


(x + 1Q@la £> = <7 ©, Bt, Fxt?/2) (1.204) 


Thus the expectation value of the position operator, i.e., the position of the 
pulse of hydrogen atoms, moves on a parabolic orbit that bends downward 
(upward) in the case of a pure spin-up (spin-down) initial state, as illustrated 
in Figure 1.1. [This presumes that x > 0, i.e., presumes (1.16).] 

Let us now describe the same situation in the Schrédinger picture, where 
the state changes in time. At the time t = 0 the state is given by 


Ix + =0)) = |x +> = 14) @ Wop, (1.214) 


The Stern—Gerlach Experiment 297 


which describes the pulse of hydrogen atoms with spin up (spin down) at 
the position A. According to (XII.1.33), the time development of this state is 


Ix + > = U'D|£) ® |Wo> = eH 20554) +> ® lo) 
<e" tint aoHD| +> & [jay = |) Be MH eatOr| yy, 
(2245) 


Thus the initial pure state with spin up will move through the magnetic field 
according to 


IX+ > =|+> @e Mn e243) y > = (+> @lv +>, (1.234) 
in contrast to the different motion 
ee | Oe HO PE) = |—> Oy —@> (23-) 
of the pure state with spin down. In (1.23+) we have defined 
PEC) Rata aad tI (1.24+) 


Now the relations connecting the momentum operator P, the position 
operator Q, and the state vectors |v + (t)> of the Schrédinger picture with 
P(t), Q(t), and |v +> of the Heisenberg picture are 


P= Pe 0) a POUG), (1.25a) 
Q = QE = 0) = UNHQMUO, (1.25b) 
[ZO = OOlx + ( = 0) = U'MIx +>, (1.25¢) 


where U(t) = e” = exp(it(Hy + 2u,5;H3(Q3))). We may write the 
Schrodinger-picture analogue to (1.14) by taking the expectation value of 
(1.12) between the state vectors |y +) and shifting the time dependence 
from the observables to the state vectors: 


© + O1Paln +O) = 4 £ IPAOIz £D 


dt 
0 oH, 
=<y+t a +> = <x +| — 2,85; lx + 
<x ae Wee ey ot ep 3 30,(0 * > 
OH, 
= = + eit(nt 2upS3H3)¢ e7 tHnt 2upS3 3H) | 4 +) 
Me | 330, ° 


— 2p 7 + | ei(H#utuplsys, oe ein z eels) | y +)» 
3 


oH 
— 2g + 15a] +) <a] ett *Hots) 352 e Ht *Hat) Wo) 
3 


= Fup + vies ma + (t)). 
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(Note that [S,, H] = 0 for the Hamiltonian H of (1.5); the matrix element 
<+ |S,| +) thus has no time dependence.) If we again assume (1.16), then 


Wt OLSEN £ O> = [Px S2  + OD GIY + OD 
3 


oH 


_ 0H; _ OH; 
= + (|v + (t))> = a 


Consequently 


ue Cy + @MIPly + @ = (0 0, + Up =) = (0,0, Fx). (1.26) 
dt 0X3 


The expectation value 


d 1 
ay oh = WOO ae Sh = RO 7 


of (1.11) easily transforms to the Schrodinger-picture analogue 
d 1 
A Xt Ole EOD = 47 x FE OIP Ix = OD. (1.27) 


Equations (1.26) and (1.27) together with the initial conditions 
(x + ( = O)|Plx + (t = 0)> = (0, B, 0) 


and 


at O10 = 0) 000) 


lead to the Schrédinger-picture analogue of (1.20+): 


1 SV 
Ke (= Cy (1) Oly) M (0 pt, + 5): (1.28+) 


As we would expect, the result is the same as in the Heisenberg picture. 

The probability distribution for the position is the “expectation value” 
of the projection operator I ® |x><x|. More precisely, the probability of 
obtaining a value in the volume (x — €, x + €) in a position measurement 
is the expectation value of the operator 


Pence el) ee in (1.29) 


(x—€) 
[cf. Equation (I1.10.8)]. Thus the position probability distribution in the 
state |y + (t)> =| +> @ |W + (> is given by 


wi(x,t) = (+1 +> + OLD lh + (OD = + OLX <XIP + (DD. 
(1.30+) 
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In particular, the probability of obtaining a value in the interval (x — €, x + €) 


at time ¢ is 
Bae 


wi(x —€,x+6,t) = i d°x' Cw + (t)|x’><x'JwW + (>. (1.314) 
The probability distribution moves with time. The exact motion of w (x, ¢) 
may be obtained by solving the Schrédinger equation for <x|W + (t)>. 
From our considerations concerning the expectation value (1.28, ), however, 
we have a rough idea of how w(x, f) moves: At t = 0, w.(x —€,x + €, t) 
and w,(x, t) will be essentially zero everywhere except in a small volume 
around the point A = (0,0,0). From A the pulse moves along the lower 
parabola, given by (1.28+); at the time ts = My,/p (ie., at the time tp, at 
which the pulse has the y-coordinate yg), w(x, t) will be essentially zero 
everywhere except in a small volume around the point B, = (0, ptg/M, 
—xKt}/2M). 
Similarly, the probability distribution 


w_(x, t) = CW — @|x><xl¥ — ©) (1.30—) 
and the probability 


w(x-6x+e)= [eva — (t)|x’) <x'|W — ()) (1.31) 


ae 
of finding the system in the volume (x — €, x + €) are the ones for the state 
ly — (t)) = | —> @ |W — (t)>, which was a pure state with spin down at A. 
Then w_(x,t) and w_(x —€,x + €,f) move along the upper parabola 
(1.28 —), and will be significantly nonzero at time rf, only around the point 
B_ = (0, ptp/M, +xt3/2M). 

These facts enable us to perform a measurement of the polarization—a 
measurement of the system-I observable $;—by measuring the system-II 
observable Q. If we put a detector plate at B (at a distance yg = pt,/M 
from A), then it will record the presence of hydrogen atoms at B, = 
(0, ptg/M, —Ktz/2M) or at B_ = (0, ptg/M, +xt%2M) only if one started at 
A = (0, 0, 0) with the pure spin-up state |y + (tf = 0)> =|+)> @|Wo> or 
with the pure spin-down state |y — (t = 0)> = | —> @ |Wo>, respectively. 
Thus a measurement of Q, on system II with result z = 2, (tg) = —Ktj/2 
constitutes a measurement of S, on system I with result s, = +5, and a 
measurement of Q, on system II with result z = z_(tg) = +4 5/2 constitutes 
a measurement of S; on system I with result s; = —4. 

We now investigate the case where the system-I state is the mixture (1.9), 
where the coefficients 4 before each of the projectors |+ >< +] and |—><¢—| 
represent the equal a priori probabilities of the system being in the states 
| +> or| —)>. Att = Othe beam is described by the statistical operator (1.10) 
with W(t = 0) given by (1.9) and W(t = 0) given by (1.7): 


W(0) = W(0) @ WO) = GI+><+1 + 21-><- 1 @ (o> Hol) 
= 31+) @lWor<+1@<hol + 21-> @l¥or<-1®@ ol 
= 214+ 0)><x+O] + 2lx-OPG- Ol. (1,32) 
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The time development of W is 


W(t) = 314+) @lW + O><+1O G+ Ol 
+31/-> @l¥-M><-1@H—- Ol 
Sh ORG eer Ole ed (9), ONE (1.33) 


The probability distribution w(x, t) = Tr(J @ |x) <x|W(t)) for the position 
is then 


w(x, 1) = $<xlW + (>< + (IX + 3<xl¥ — (DOH — (OIXD (1.34) 
= 4w,(x, t) + 4w_(x, 0), (1.35) 


where w,(x, t) and w_(x, t) are the probability distributions of (1.304), 
and their (significantly) nonzero parts move downward and upward along 
the parabolic trajectories (1.28+) and (1.28—), respectively. Thus the un- 
polarized beam is split into two beams. 

This was in fact what Stern and Gerlach observed, in contradiction to the 
predictions of classical physics.* Classically an unpolarized beam contains 
atoms with magnetic moments equally distributed in all directions. The 
deviation would be proportional to the magnitude of the z-component of 
the magnetic moment, and hence one would expect the narrow beam to 
just smear out in the z-direction. 

We can now add (at least in a Gedanken experiment) a second magnet that 
reverses the splitting effect of the first magnet. The two beams are then 
brought together again; and at D at time tp = Myp/p we again have the 
same state as at A at time ¢ = 0, ie., a mixture described by (1.32). If this 
pulse is now passed through another Stern-Gerlach apparatus with the 
magnetic field in the x-direction, it will split into two beams. This is easily 
seen by representing W, as 


Wa=AN=5= 2181 =+)><s,= +271 + dl = —$><s, = —4l, 
(1.36) 
where |s,; = +4) and |s,; = —4) are the basis vectors of #; satisfying 
Sls, = + = +4315, = +; (137) 


the above arguments are then repeated with the z-component exchanged 
with the x-component. 

We will now analyze the Stern—-Gerlach experiment for the case that the 
state at A is a pure state, polarized in any direction n other than the z-direction 
(this case we have already treated), e.g., the x-direction.* At time t = 0 the 
state is then described by the statistical operator 


W(O) = 1x0)<x(0)| = 16¢41 @ IHor<Wol (1.38) 


> The original Stern-Gerlach experiment (1922) used silver atoms rather than hydrogen 
atoms. 

* Use of the Pauli spin matrices easily gives |s, = +4 > =(1/./2)(| +>+ |- >) (upto 
an arbitrary phase factor). For the Pauli spin Matrices see problem 7 of Chapter III. 
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or by the vector 


1x)> = 16> @ Woo = a1 +> @ lwo» + BI—> ®@ lwo, (1.38’) 


where | ¢> is given by (1.8). This state develops in time according to (XII.1.35); 
using the above results (1.22), 1.23), and (1.24,) one obtains 


Ix)> = a1 +> @lw + > + Bl -> @lw — >. (1.39) 


Now in a unitary time development a pure state goes into a pure state. The 
system will therefore remain in a pure state while it passes through the mag- 
netic field; there is no splitting into two subensembles. If the beam reaches 
D after having passed through both magnets, the pulse is again in the pure 
state with the same polarization that it had at A. For example, if we start at 
A with a beam polarized in the x-direction, it will again be polarized in the 
x-direction at D at the time tp = Myp/p; if it is then sent through the second 
Stern—-Gerlach apparatus with the field in the x-direction, it will not split, 
but will just bend. 

Suppose now that we perform a position measurement on this pure state at 
B, i.e., we place at B a measuring device (screen) that lets the beam pass 
through but that registers the presence of hydrogen atoms. Recall that the 
probability of finding the hydrogen atom in the volume (x — €,x + €) 
around x is given by the expectation value of the operator J @ A(x, €) of 
(1.29) with the statistical operator 


W(t) = |x) <x) = Ia? 1 +> @lW + (OD<4+1@ H+ C/O] 
+ |B?|—->@|¥ -M><-l@W-@O 
+ oB| +> @l¥ + ()<-1@<y -O| 
+ @B|-—> @|¥ -—-@M)><+1@W+ (| 
(1.40) 
corresponding to the pure state (1.39); thus 


Winy(X — €& X + €) = Tr(W(t)T @ A(x, ©) 
= |al? Tr] +><+ |) Try + (> <w + (OIAC, ©) 
+ |B? Tr(| —><— |) Trl — O><w — OIACx, ©) 
= |a|?w.(x —e,x + €,t) + |B/?w_(x —e,x 4+ «, 2), 
(1.41) 
where w,(x — €, x + €, ¢) is given by (1.31 +). The probability distribution 
for the position, i.e., the “expectation value” of the “operator” I ®@ |x><x|, 
is obtained in the same manner, and is given by 
Wie (X) = lal?wi(x, t) + |B|?w_(x, 2). (1.42) 
For the case that the hydrogen atoms were originally polarized in the x- 
direction (i.e., |a|? = 4 and | B|? = 4), 
Wi ny(X) = 2W4(x, t) + Zw_(x, 0). 
Thus the position probability distribution for the pure state (1.8) is the 
same as the probability for a position measurement in the mixture with the 
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system-I statistical operator 
Wi=laP?l+><+| + IBP1—><-I- (1.43) 


From our discussion of the motions of w, (x, t) and w_(x, t) we conclude 
that the hydrogen atoms pass through the screen at B, and at B_ with 
relative probabilities |«|? and |£|?, respectively. Now Equations (1.40), 
(1.41), and (1.42) depend upon the assumption that the system is in the pure 
state | y(t)> given by (1.39). This assumption is valid only for t < tg, for after 
the position has been measured at B the state is no longer given by | y(¢)>; 
it is changed to a new state bythe measurement process. If the screen has 
openings only around B, and B_, then, according to the basic assumption 
IIIb. [Equation (II.5.1)], the state is given at time t = tg by 


W(t) = A(Xgs, ©) X(te)> <x(te)|ACXp+ , ©) 
+ A(xp_, €)|x(te)> <x(te) | A(x, €). (1.44) 
Now 


A(xg+,©)|W + (tp)><W + (tp)| A(Xps, ©) 
(xp+ +€) 
= [7 Px x" (xd IY t DG £ DIX" 


XB+te€ 
Since <x’|y — (tg)) = 0 for x’ near xg, and since <wW — (tg)|x"> = 0 for 
x” near Xz,, it follows that 
A(xg+,€)|W — (te)><w — te)|A(xp+, €) = 0. 


Similarly 


A(xp-5 ©)|W + (ta)><W + (ta) |ACKe-, €) = 0. 


Expansion of (1.44) by means of (1.39) and use of the last three equations 
allows us to rewrite (1.44) as 


W*(ta) = |al?| +><+ | @ Alxss, OLY + (te)><W + (te) A(Xp+, ©) 
+ |Bl?| —><— | @ A(xg_, 1 — (ta) <W — (ta) |ACXp-, ©). 

(1.45) 
We thus see that the performance of a measurement changes the situation 
drastically. If the beam in the pure state (1.39) passes B without a measure- 
ment being performed, then the system remains in this pure state | y(t)>. Ifa 
measurement is performed as the beam passes B, 1.e., if it is registered that the 
hydrogen atom has passed through the opening at B, or at B_, then the 
system becomes a mixture of the electron spin-up and spin-down states. For 


example, if the openings in the screen are symmetrical with respect to the xy 
plane, 


w(Xp4 — ©, Xpy + €, tg) = w_(Xp_ — €, Xp_ + ©, tp), (1.46) 
and are big enough to let all of the particles in the beam pass through, then 


A(Xe+, 1M + (te) + (te) ACXe+ ©) = |W + (ta)> CW + (te). (1.47) 
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The state after the measurement then becomes 


W*(t) = jal?1+><+1@lW+ OW + Ol 
+ |B? 1—><-1@l¥ —-(<w — (| 
=|a?|+>@l¥+(><+1@W+ | 
+ |BP|—> @lv¥ -M><-l@wWw-Ol 
= lal lx+ O><x+ Ol + BP lx -— O<x- Ol = MD 
loft =a (1.48) 


If the state (1.48) now passes through the second magnet, it will arrive at 
D ina mixture of spin-up with spin-down states. A subsequent Stern—Gerlach 
apparatus with magnetic field in the x-direction will then split the beam. 

In summary, we have found that the pure state (1.38) is not split by the 
inhomogeneous magnetic fields of the first and second magnets unless a 
position measurement is made at the plane B. If such a measurement is 
made, however, the pure state splits at B into two subensembles of spin up and 
spin down, and consequently into two subensembles of spin right (s, = +4) 
and spin left (s; = —}), by the second Stern—Gerlach apparatus, whose 
field is in the x-direction. This concrete example illustrates the content 
of the basic assumption IIIa. 


XUI.A Appendix 


In (1.4) and (1.12) we have neglected the force upon the particle except for 
the force in the z-direction. This requires a justification, because due to the 
equations of electrodynamics 


VoH=0 Voor — 0) (A.1) 
Therefore a large 0# 3/0x, will at least require also a large 
OH, oH; 
— = —-—— A.2 
Ox, 0x3 vee 


and therewith a force in the x-direction. We shall describe in the following 
the simplest theoretical arrangement,’ in which the momentum acquired 
by the particle in the x-direction never becomes large. This can be achieved 
if one chooses a large magnetic field in the z-direction. This field causes a 
Larmor precession of the magnetic moment, as a consequence of which the 
time average of the force in the x-direction, F, is small—not the force F, itself. 

Choosing (A.2), we neglect any variation of the magnetic field with x, 
(edge effects). From (A.2) it then follows that 


H3= AX3 = P(X1), 
Hy, a AX ae W(x3), 


(A.3) 


> Based on a homework problem by J. W. Alred. 
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where #(x,) and (x3) are arbitrary functions except for the requirement 
that H fulfill (A.1). The second equation of (A.1) requires 


0H, a oH, = oH, < oH, =); (A.4) 
Ox,  GOx5 OX, OX, 
and 
OH; _ OH (A.5) 
ax, OX; 


Equation (A.4) follows from the neglect of any variation with x,. Equations 
(A.5) and (A.3) then lead to the condition 


CAs) Mess) (A.6) 


Ox, Ox; 
which is fulfilled if we choose 


P(X%1) = Ho, = W(X3) = 0. (A.7) 


Therewith we have a magnetic field which can have a strong component in 
the z-direction and is inhomogeneous in the x- and the z-direction: 


H = AX eC + (Ao = AX +)C3. (A.8) 


This is the simplest possibility compatible with the Maxwell equations; in 
an actual experimental setup the field will, of course, be more complicated. 

The torque on a magnetic moment M in a magnetic field H is M x H, 
which leads to a change in angular momentum (I[X.3.10): 


dS 
— =MxH. : 
: i 4 (A.9) 
And as for the electron, spin and magnetic moment are connected by (1.2): 


dM 


ap =~ 2lteM x H. (A.10) 


If we choose for the magnetic field (A.8) with 
Hg 2 \ AX | = lees (A.11) 
then 


dM 


= —2ptpM x Hoes. (A.12) 


This means that the magnetic moment operator M performs a precession 
around the z-axis e, with frequency 


w= 2ppH. (A.13) 
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This can be seen by differentiating (A.12) again and writing it in component 
form: 


d’M, 4 aM, da’M 
dt2 ee M,, ae = —w*M), ie = . (A.14) 
The solution of (A.14) is 
M,,2(t) = M,,.(0)e7', (A.15) 


whereas the component M, does not change in time. 
Instead of (1.4) one has now 


Hix = —M,(t)4Q, — M, Hy — M;3(4Q3), (A.16) 
and instead of (1.12) one has for the force operator 


dP(t) _ 


1 1 
ar = 7 7 PPO. Q11Mi(04 - = 0PO,O31M34— (A.17) 


The second term on the right-hand side is the same as the term in (1.12) 
except that we have here the special case 


a 


Sy 
dQ3(t) 
As a consequence the expectation value of P(t) is given by (1.19). 


The first term in (A.17) results in a component of the force operator in the 
x-direction: 


(A.8’) 


dP (t : 
WD _ Mlk = My(Oyhe™ 


which oscillates with the frequency (A.13). Though its expectation value may 
be of the same order as (1.17), it oscillates rapidly under the condition (A.11). 
And so does the expectation value of P,(t). The time average of the force as 
well as the momentum in the x-direction is zero. In an experiment, which 
always measures the average value of the expectation value over a finite 
time interval, a deviation from zero in the x-direction will not be observed. 

In the conventional Stern—Gerlach experiment® the magnetic field is not 
given by the idealization (A.8) but is usually of the form 


H = Ai,(x%, X3)e; + (Ho + H3(x1, X3))e3 


with H,(x,, x3) and H;(x,, x3) fulfilling (A.5) and (A.2) and | Ho| > | H,| ~ 
|H,|. Clearly, the same arguments as above apply, and (1.4) suffices to 
describe the observable effects. 


§ For other arrangements of a Stern Gerlach experiment see Myer Bloom and Karl Erdman, 
Can. J. Phys. 40, 179 (1962). 


CRIAPTER AIV 


Transitions in Quantum Physical 
Systems—Cross Section 


This is the fundamental chapter on scattering theory. Here the concept of 
transition probabilities is introduced and general formulas for the cross 
section are derived from the basic assumptions of quantum mechanics. In 
Section XIV.2, a general formula for the transition rate is obtained, which 
will be used in Section XIV.5 for the derivation of the cross-section formula and 
in Chapter XX] for the derivation of the decay rate. Section XIV.3 introduces 
the concept of cross section, and Section XIV.4 gives a brief description of 
the different ways to relate the cross section to fundamental physical ob- 
servables. The main section of this chapter is Section XIV.5, where the cross- 
section formulas for very general physical situations are derived. At the end 
of section XIV.5 the physical state of the scattering system is further specified 
to obtain the well-known expressions for the cross section. For a superficial 
understanding of the material of this chapter one may omit parts of Section 
XIV.2 and Section XIV.5. For this purpose we have discussed special results 
of Section XIV.5 at the end of Section XIV.4. 


XIV.1 Introduction 


In the present chapter we collect some material that will be used in subse- 
quent chapters to describe collision and decay processes. We derive the 
transition probability, introduce the notion of transition rate, and then define 
the cross section and derive the formula that relates it to the transition 
probability. 
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The general situation that we shall describe, and of which collision and 
decay processes are just two particular cases, is the change in time of a 
nonstationary state W(t). We shall use the basic assumptions of quantum 
mechanics, in particular the axiom V. We shall make the further simplifying 
assumption that the generator H of time translation can be split into two 
parts: 


H=K+YV, (1.1) 


where K is the energy operator of the isolated physical system with stationary 
states and is assumed to be well defined. This means that it is assumed to 
make sense to consider an approximate description of the physical system 
by a stationary physical system. There are many examples for which this is 
possible. Thus K may be the energy operator of an atom or of a molecule. 
In the approximate description the states belonging to an energy level are 
stationary, but in reality they are only quasistationary and decay into the 
ground state. This transition is then caused by V = H — K. Another example 
is the combination of two physical systems that can be spatially separated far 
apart from each other, a situation which one usually encounters in collision 
processes. If they are far apart, the energy operator of the combined system 
is K; and V = H — K, which describes the interaction between the two 
subsystems, has a finite range.' 

The observables that are measured in these processes are assumed to 
commute with the operator K. For example, a detector may register all the 
ground states of atoms that were initially in excited states (by registering, 
for example, all emitted light of a particular frequency). The observable 
measured is thus the projection operator on that subspace of the space of 
physical states which contains the ground states of the atoms, i.e., the ob- 
servable measured is the property of being in the atomic ground state (cf. 
Section II.4). In the other example of collision experiments the detector is 
placed far away from the target and therefore detects eigenstates or mixtures 
of eigenstates of the operator K. The property B measured by the detector is 
described by a projection operator Ag, which projects onto an (in general 
continuous) direct sum of energy eigenspaces of K. Usually B is a more 
specific property. For example, the detectors may not be placed all around 
the target, but only at a particular angle. A, is then the projection operator 
onto that particular subspace of the above direct sum of energy subspaces that 
contains states whose momentum vectors are directed into the particular 
solid angle. 

Thus the problem that we shall discuss (in the Schrodinger picture) is the 
following: The state W(t) changes in time according to 


Wt) =e HtWetitth = ((W = W(0)). (1.2) 


1 It should be remarked that the assumption (1.1) is not really necessary for the description 
of collision processes; it suffices to assume only that asymptotic direct-product states do exist. 
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What is the expectation value of an observable A (i.e., what is the probability 
of measuring the property A) for which 


PAK l= 0? (1.3) 


This problem is often interpreted as the transition between different sta- 
tionary states of a physical system caused by an interaction V. This is based 
on the assumed existence of a K eigenstate W'"(t = — 00) in the remote past 
and of a K eigenstate W°"'(t = +00) in the far future that equals or con- 
tains A. 


XIV.2 Transition Probabilities and Transition Rates 


Denote the state of the physical system by W, and the observable to be 
measured by A. Then, according to the basic assumption II, the expectation 
value <A) of A is given by 

<A> = Tr(AW). (2.1) 


Since A is a projector, <A> is the probability that the system is in the sub- 
space AH. <A) is called the transition probability into the state A. 

The transition probability is a function of time. At the beginning of the 
process <A) will in general be zero, and will then increase with time. Ac- 
cording to the results of Chapter XII, this time dependence can be described 
in various pictures. We shall use the Schrédinger picture first, because it 
appeals best to our intuitive understanding of a scattering experiment with 
a pulsed beam or of a decay process. In the Schrédinger picture the ensemble 
of physical systems is described by the time-dependent statistical operator 
Wt), which describes the beam moving towards the detector, or the excited 
State of an atom decaying into the ground state. 

The observable A, which represents the apparatus, does not change in 
time. [We assume that A does not depend explicitly upon time (@A/ét = 0) 
i.e., that the apparatus is not influenced from the outside after the scattering 
experiment has begun.] Thus the transition probability is given by 


<A>, = TKAW®)), (2:2) 
where 
Mt) = UN()WU(t) (W = W(0)); 
W is the state of the system at some conveniently chosen time, which we 
call t = 0, and U(t) = e'#"" is the time-evolution operator of Chapter XII. 
The transition rate is defined to be the time rate of change of the transition 
probability: d<A),/dt. Differentiation of (2.2) and use of (XII.1.33) give 


d 7 dW(t) 
Ma Te(A a 


Substitution of H = K + V into (2.3) yields 


= —i Tr(A[H, W(t). (2.3) 


: <A>, = —i Tr(ALV, W(t)}) — i Tr(ALK, W(t))). (2.4) 
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With the use of (1.3) the second term in (2.4) is easily shown to vanish: 
Tr(AL[K, W(t)]) = Tr(ALK, W(t)] + LK, AJW(0)) 
= Tr(AKW(t) — AW(t)K + KAW(t) — AKW(t)) 
= —Tr(AW(t)K) + Tr(KAW(2)) = 0. 
Let {|b>} be a basis for the subspace A¥% onto which A projects. Then 


Tr(AW(t)V) = 2<b| W(t)V |b> = <b| VW(t)|b>* 


= (Tr(AV W(t)))*. 
The right-hand side of (2.4) may then be written 


. <A), = —i TrAVW(t)) + i THAVW(D))*. (2.5) 
But i(a* — «) = 2 Im «; consequently ** 
h e <A>, = 2 Im Tr(AVW(t)). (2.6) 


In order to express the transition rate in a form more easily used, we use a 
basis of generalized eigenvectors of K and of H: 


Kla> as E,|a), (2.7a) 
and 
H\a*> = E,|a*). (2.7b) 


Recall that the generalized eigenvectors |a* > of H are related to the general- 
ized eigenvectors |a> of K by means of the Lippman—Schwinger equation 
[Equation (VIII.2.7’)]: 
1 
i= ——_————— V|a*). 2.8 
|a*> Cree) |a*> (2.8) 
On occasion we shall make the labeling on |a> and |a*) more explicit by 
writing 
(a)|a> =|E,4> and (b) |a*> = |E,4*), 


where dG = (a;,a5,...,4,) are labels needed in addition to the energy to 
specify the generalized vectors. These labels (quantum numbers) are eigen- 
values of additional operators A,, A2,..., A,, which together with K form 
a c.s.c.o. For example, d may consist of the angular-momentum quantum 
numbers /| and |, together with other, internal quantum numbers n; or d may 
consist of the direction p/|p| of the momentum p together with other quantum 
numbers 7. If the operators A; (i = 1, 2,..., k) all commute with H, then the 
|E,a*> are generalized eigenvectors of the c.s.c.o. {H, Aj,..., A,}; other- 
wise the additional labels a just serve to indicate that |E,a*» is obtained 


12 We again use the units with A = | and introduce only in a few important formulas the 
correct factors of h. 
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from |E,a@> by way of the Lippman-Schwinger equation, and do not indicate 
that |E,a*> is an eigenvector ofall the A,’s (for more details cf. Section XV.1). 
We shall assume that the K eigenvectors |a> are normalized according to 
<ala’> = (E,4|E,a’> = ee is "Sea O(E, = E,:). (2.9a) 


[p(E,) is a weight function that is arbitrary but fixed; a convenient choice 
for p(E,) will be made later.] With such a normalization the summation 
)., is really an abbreviation for the more explicit 


L= | p(E,) dE, DA (2.10) 


The generalized eigenvectors |a* > of H have the same normalization 
<a* |a’*> = <E,a* |Eaa*> 3 P(E.) "Saar O(E, ne Ey) (2.9b) 


as the corresponding generalized eigenvectors |a> of K. The derivation of 
(2.9b) from (2.9a) is given in Appendix A of Chapter XV. 

In general the spectrum of K and the spectrum of H are not the same. 
Usually K has only a continuous spectrum {E,} starting at a particular 
value, say E = 0, and going to infinity. But the spectrum {E,} of H is the 
combination of a continuous spectrum {E,}, which is usually the same as 
the continuous spectrum of K, and a discrete spectrum {E,}, which may be 
negative but is bounded from below. Physically the continuous spectrum 
corresponds to the scattering states and the discrete spectrum to the bound 
states of the projectile and target. There may be discrete eigenvalues in the 
continuous spectrum, but for physical reasons the energy eigenvalues can- 
not be arbitrary large—in particular, not arbitrarily large and negative. We 
may thus choose the generalized eigenvectors of K: 


{a> = |E,a>} 
as a basis for the space of physical states, or we may choose the set 


{la} = {la*> = |E,4*)} v {|a,>} 


of discrete eigenvectors |«,> and generalized eigenvectors |a*> of H as a 
basis. The completeness property of these bases may be expressed as 


I= Yi la><al = foe. dE, )||E,4><E,4| (2.1 1a) 


and as 
T= Yla><a| = [oe dE, ¥|Egd*><Ega*| + )' |a,><oq|. (2.11b) 


The trace that appears on the right-hand side of (2.6) may now be written 
Tr (AVW(t)) = 9 <b| VW(t)|b> 
b 


= LL lV lay <al WO Ld». 
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Use of the Schrédinger-picture time development (1.2) of W(t) and insertion 
of another complete set of H eigenvectors |«’> then gives 


Tr(AV W(t)) = S Sy <b|V a> <ale~ Wet tl ay <a’ |b» 
b aa’ 
=) 2d, <bIV a> <a] Wa’) <a'|b>e~ Ma Be' (2,12) 
b aa’ 


Suppose H does have discrete eigenvalues, corresponding to bound states 
|x,>. Since bound states cannot evolve from free states, we assume that the 
State W contains no contributions from bound states: mathematically this 
assumption is expressed by 


CALUACe as (2.13) 
We may therefore restrict the summations over « and a’ to the continuous 
spectrum of H. 


Since [A, K] = 0, the basis {|b>} of the subspace AH may be chosen to 
consist of generalized eigenvectors of K: 


K|b>=E,|b> (|b) = |E,, 5»). (2.14) 


If we take the inner product of the Lippman-Schwinger equation (2.8) 
with |b>, we obtain with (2.14) 


1 
tt = , ‘ rt bh 
<a’* |b)» en <a VE Ky” 
1 
= {a'|b> + li a 215 
<a’'|b> be <A | Le a ae? (2.154) 


Use of (2.15)? in (2.12) yields 


+ + 
TrAVW(t) =¥ yen SAAD |Wla’t> +11, 
b aa’ Ey = E, — i0 
(2.16) 
where 
ee Cb V la ><a lb a |W|a'* >. (2.17) 
b aa’ 
In Appendix XV.A we shall show that the second term IJ vanishes; we con- 
tent ourselves here with an intuitive argument: The factor <a‘|b> in the 
second term does not describe a transition, but describes the probability of 
observing the configuration of the initial state. But in a scattering experiment 
(except for absorption measurements, which do not measure <A> directly), 
the detectors are placed at an angle to the direction of the incident beam so 


? That it is more natural to choose ¢ - 0+ rather than ¢ — 0— can be seen from Section 
XV.3, where we discuss in detail the significance of the choice « — 0+. However the derivation 
could have as well been continued with (2.15_) and would lead to the same results. 
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that they are not flooded by the incident beam. Thus | E, A) ¢€A# for all 
eigenvectors |E, A) that give a nonzero contribution to the initial state, and 
the term II does not contribute to the probability of observing A.* 

The transition rate is then obtained by inserting the first term of (2.16) 
into (2.5). By using the Hermiticity properties 


<a*|Wla'*>* = <a’*|Wla*> and <b|V\a*>* = <a*|V|b> 


of W and V, and by exchanging the summation indices a and a’ in the second 
term arising from (2.5), one obtains 


4 (Ad, = ~EY Le tee Kb|V a) <al* |V 1b) <a* |Wla’*> 


b aa’ 


1 1 
: (= pest Ek awe 3) (2.18) 


We will make use of this last result both in Section XIV.5 when we calculate 
cross section and in Section XX when we calculate the decay rate. 


XIV.3 Cross Sections 


In a scattering experiment a beam is directed towards a small target. A 
detector is located at a large distance from the target and at a (nonzero) angle 
Q = (8, d) with respect to the incident beam. 

In classical physics this beam can be a beam of particles or a beam of 
radiation. For the case of particle scattering, for example, a beam of Nz 
particles of type B with mass m, and velocity v, is, during an interval of time 
At, sent towards a target consisting of N; particles of mass m;. The number of 
particles per second that are scattered and reach the detector, N/At—the 
counting rate—is proportional to both the number N; of particles in the 
target and to the incoming flux (1.e., to the number of particles per second 
per unit area perpendicular to vy). The incoming flux is given by 


incident flux = pgVo, Gab 


where pz, is the density of particles in the beam. Thus the counting rate 
N/At is 


N/At = ONrPpYo, (32) 


where the proportionality constant o is called the cross section. The cross 
section for the scattering of particles by particles is thus defined by 


_ _NINg 
 PpVo At’ 


G3) 


* Note that IT is the term that comes from the first term in the Lippman- Schwinger Equation. 
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Pa, Yo 


p= 
target 


Figure 3.1 Schematic diagram of a scattering experiment. 


It has the dimensions of an area. [If the target is not at rest, then the relative 
speed v = |V9 — v;| should be used in place of vp in (3.2).] 

Different kinds of cross section are given different names. If the detector 
detects all particles of a third kind that leave the target area in all directions, 
then o is called the production cross section. If the detector detects particles 
of type B, the incident type, after they leave the target area, then c is called 
the scattering cross section. In particular o is called the total scattering cross 
section if the detector detects all particles B leaving the target area without 
regard to the direction in which the particles leave. The differential cross 
section do is obtained if only those dN particles which are directed into a 
specific cone of (infinitesimal) solid angle dQ = sin @d@d@ are detected 
cf, Figure 3.1): 


dN/At do dN 1/At 
OF = 


ag = ee 
i PaNrvo dQ dQ pgNrvro 


(3.4) 


The other example of a scattering experiment in classical physics is the 
scattering of a beam of radiation incident on an obstacle (target). The 
detector, which is located outside the incident beam at a distance large 
compared with the wavelength of the radiation and with the size of the ob- 
stacle, measures the flux scattered in a given direction. The ratio of the 
scattered power dI,,,, to the incident power per unit area (flux) J;,cig is the 
differential cross section 


do = Asean (35) 


Hescsa 


In quantum physics the beam is a beam of quantum physical systems, and 
the measured quantities are probabilities. The detector measures the proba- 
bility of a transition from the incident state. This transition probability is 
proportional to the probability that a particle before scattering is passing 
through a surface of unit area perpendicular to the incident velocity. The 
proportionality constant is the quaantum-mechanical cross section: 


transition probability 


== os ——. (3.6) 
incident probability per unit area 


As in the classical case, there are different kinds of cross section. If the 
detector registers all states that emerge from the target, then o is the total 
cross section. If the detector registers only a subset of states (e.g., only those 
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states which are in a state B), then o = og, is called the partial cross section. 
Thus 


transition probability from a state A into a state B 
incident probability per unit area ; 


OBA = (3.7) 
Suppose B is the property that the system’s momentum vector is directed 
into a specific cone of solid angle dQ around the direction Q = (6, ); then 


(transition probability from a state A into 


1 all states with momentum vector directed into dQ) (3.8) 
Soa = : : 
a incident probability per unit area 


or more often dog,/dQ is called the differential cross section. 

In typical scattering experiments the projectile is chosen to be as structure- 
less as possible (e.g., electrons), whereas the target consists of more com- 
plicated objects (e.g., atoms). The experimental cross sections then reveal 
information about the structure of the target particles. The collision of the 
projectile with the target can be elastic, as is the case when the energy of the 
projectile is smaller than the difference between the internal energy levels 
of the target; or the collision can be inelastic, as is the case when the internal 
energy level of the target is changed by the collision. To fix the nomen- 
clature we shall use “total collision cross section” for the cross section o 
of a collision that involves any elastic or inelastic process. The cross section 
Oelas for all elastic collisions is called the total elastic cross section while the 
cross section @, for inelastic collisions into the nth internal energy level will be 
called the total inelastic partial cross section for the yth level. The total 
inelastic cross section is )', ¢,. The total cross section o is then the sum of 
the total elastic cross section with the total inelastic cross sections: 


C—O Oy. (3.9) 
” 


Corresponding notation is used for the differential cross section. 


XIV.4 The Relation of Cross Sections to the Fundamental 
Physical Observables 


Cross sections are the observable quantities. Their measurement follows in 
principle from their definition, given in the previous section, in terms of 
counting rates and flux.* The cross sections are related to the fundamental 
observables which describe the structure of the physical system consisting 
of the combination of projectile and target. We shall now establish the con- 
nection of the cross sections to these quantum-mechanical observables. 

If the theory of the physical system (i.e., the mathematical structure that 
describes the system) is known, then one can calculate the cross sections in 


* The actual measurement of cross sections is described in books on experimental physics. 
See, for example, Massey et al. (1969, Vol. 1). 
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terms of the known fundamental observables and predict the outcome of a 
collision experiment. More often a physicist meets the reverse situation: 
The cross sections have been measured experimentally; from this informa- 
tion conjectures are made as to what the fundamental observables are, how 
they behave, and what mathematical structure best describes such behavior. 
This task is least complicated if one of the colliding subsystems (called the 
projectile) has a known structure that is as simple as possible. Such a pro- 
jectile serves to test the structure of the target system and to provide informa- 
tion for a theoretical model of the target system. Some examples are electrons 
as projectiles with atoms or molecules as targets, protons as projectiles with 
nucleons as targets, and electrons or other leptons as projectiles with hadrons 
as targets. 

To relate the cross sections to fundamental observables one may proceed 
in two different ways. The first and conventional way is based upon the time- 
development axiom V of Chapter XII. It is assumed that there exists an energy 
operator H in the system’s algebra of operators that develops the system in a 
continuous fashion. It is usually further assumed that the energy operator 
H can be split into parts, 


H=K+V, K=K;@1I+1®Krz, (4.1) 


where K is the energy operator of the combination of the physical systems 
in the absence of the interaction V between them, K, is the energy operator 
for the beam system, and K, is the energy operator for the target system. 
K is thus the energy operator when the projectile and target are far apart.> 

The existence of a continuous unitary time development (1.2), generated 
by an operator H according to the basic assumption V, is a rather question- 
able assumption in particle physics, where a fundamental length is believed 
to exist. Therefore a second way was suggested by Heisenberg (1943). It is 
based on the realization that W(t), which describes the systems while they 
are interacting, is really not a measurable quantity. Measurable quantities 
are the initial state W(t ~ — oo) that describes the systems before the 
interaction when they are prepared for the collision process, and the final 
state W(t > +00) that describes the systems after the interaction when 
they are detected. Each initial state is transformed into a final state by the 
interaction. The operator that describes this transformation 


wit) = we're) = s w(t) cq a 
or, for pure states, 
ir(t) > Por") = SP" (4), 


is called the S-operator (“S-matrix”). 


5 The assumption (4.1) that H can be split is not really essential. One can derive expressions 
for the cross sections from the assumption of the existence of a trme-development generator H 
alone. See, for example, Goldberger and Watson (1964, Chapter 5). 
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S is to have the following properties: (1) It transforms superpositions 
a®'" + bP!" into superpositions a + b'¥°"'; therefore it must be a linear 
operator. (2) It transforms every normalized initial state uniquely into a 
normalized final state (“conservation of probability”); this, together with 
the assumption that the set of initial states as well as the set of final states 
spans the space of physical states, requires that S be unitary: SSt = S'S = 1. 

The cross section can then be related to the matrix elements of the op- 
erator S (S-matrix). Thus in this second approach the existence of an unitary 
S-operator is taken as the fundamental postulate in place of the basic as- 
sumption V of a continuous unitary time development. 

If the Hamiltonian H exists and is the generator of a continuous time 
development, then the S-matrix may be expressed in terms of H.° The S- 
operator is still a meaningful concept, however, even if this is not the case. 
Heisenberg’s assumption that it is the S-operator (and not the Hamiltonian) 
that is the fundamental physical quantity has become the basis for an attempt 
at a new formulation of elementary-particle theory known as S-matrix 
theory. 

The first way thus assumes more and allows one to relate the observable 
quantities, such as cross sections, to fundamental observables that reveal 
more of the physical system’s structure. The second way denies the pos- 
sibility of this deeper insight into the structure of the physical system. For 
atomic scattering experiments in nonrelativistic quantum mechanics, the 
existence of an elementary length of less than 107 !* cm (=10~° A) can be 
ignored. We can therefore connect the cross section to our basic assumptions 
for quantum mechanics (in particular to the axiom V) by proceeding in the 
first way, along which we shall meet the S operator as a derived quantity. 

In the next chapter we shall examine in more detail the concept of in- 
coming and outgoing states and their relationship to the S-matrix. 

In Section XIV.5 we shall give a derivation of the cross section from the 
basic assumptions of quantum mechanics under very general conditions. 
Here we list a few special results for the benefit of the reader who does not want 
to go through the tedious derivations of the following section. 

The cross section is expressed in terms of the matrix elements of the 
interaction Hamiltonian V or the T-matrix. The matrix elements of V and 
T are related by 


CN, QE|T| EQ 4> = <n, QE|V | EQ'n; > (5.41’) 


Here E = p?/2m + E, denotes the energy of the target-projectile system, 
E,, denotes the internal energy (which may be a function of the internal 
quantum numbers 7), and Q = (6) denotes the direction of the projectile 
momentum, p = pQ. |EQn*)> is the eigenvector of the exact operator H, 
and | EQn)> is the eigenvector of the free-energy operator K = H — V. Ifa 
Hamiltonian time development does not exist, then the T-matrix is con- 


° In this context the S-matrix was first introduced by J. A. Wheeler in 1937. 
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nected with the S-matrix by 
(NQE|S|E'O'n'> = (nQE| E'Q'n'>) — 2ni6(E — E)KnQE|T | EQ'y’). 
(XV.3.36’) 
If the generalized eigenvectors are normalized according to 
(EQn| EQN’) = Cpn|p'n'> = by OP — P’), 
then the differential cross section for the scattering of a projectile with mass m, 
and momentum p, on a target in a state 7, into a particle with mass m, 


going into the direction 2, = p,/p, and leaving the target in the new state 
Np, is given by 


(22)*h?m, Mp P(E 4) 


Pa 


do 
dQ (Q,E 4, — Pana) = IC E4Q,n5| T|pana>l?, Oriel) 


where 


2 
P(E.) = /2m(E,— Ey),  E,= 24 + Eny. 
2m, 
For elastic scattering, m4 = m,,4 = Np, this goes into (5.62) at the end of 
Section XIV.5, which can be expressed in terms of the elastic-scattering 


amplitude (5.63) given by (5.64). 


XIV.5°° Derivation of Cross-Section Formulas for the Scattering 
of a Beam off a Fixed Target 


We shall now give the precise expression for the quantum-mechanical de- 
finition of the cross section in terms of the state and the transition probability. 
We shall then proceed to derive various cross-section formulas. 

The transition probability per unit time, corresponding to d(N/N7N,3)/dt 
for classical particle scattering, is in quantum mechanics given by d<A),/dt. 
The probability that the system will make a transition during the time period 
At taken by the experiment is (d<A>,/dt) At. 

The probability density for the beam position, corresponding to p;/Ng 
for classical particle scattering, is given in quantum mechanics by 
<x|Wir(t)|x>, where Wi?(t) is the statistical operator describing the state 
of the incident beam system. If we assume a small spread in the velocities 
of the beam’s constituent particles, the incident probability per unit area 
per unit time for a beam that is incident in the z-direction with an average 
velocity v9 is <x| W¥#(t)|x>v0. 

According to its definition (3.7) and corresponding to its classical ex- 
pression (3.3), the cross section for an experiment that runs over the period 


6° To follow these derivations may require special effort. We have therefore presented a few 
special results needed for the following chapters at the end of section XIV.4 and this section 
may be omitted in first reading. 
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At is then given as the proportionality factor o in the equation 


dKA> 


ii At = 0<x|W(t)|x>v At. (5.1) 


If the experiment extends over a long time period, the defining equation (5.1) 
goes over into the equation 


[ ees “le dt o{x|W#(t)|x>v9. (52) 


If the beam does not consist of structureless quantum physical systems 
but has internal degrees of freedom described by a set of quantum numbers 
A, then the beam’s position probability density is ys , <xA| Wit(t)|xA>, where 
|xA> is a basis of generalized position eigenvectors for the space of physical 
states of the beam system. The quantum numbers / usually describe the 
polarization (spin state), and we shall therefore generally refer to A as the 
polarization, even though in the case of more complicated projectiles (e.g., 
atoms, molecules, or ions), A could stand for a whole set of quantum numbers 
upon which the energy eigenvalues of the beam system may depend. 

If there are ae then one has 


— 


dt =| dt ovo » <xA| Wir(t)|xA> (5.3) 


mail? Of 
instead of (5.2). 
We assume in this section that the target is fixed in position; v, is thus the 
average relative speed between projectile (beam) and target. We further 
assume that in the absence of the interaction V between the component 
systems, the statistical operator W'"(t) for the combined projectile-target 
system would be factorable into the product 


win(t) = Wat) ® Writ) (5.4) 


of a statistical operator W¥(t) describing the state of the projectile with a 
statistical operator W'f(t) describing the state of the target. Wi?(r) acts in a 
space of states #,, Wi"(t) acts in a space %,, and W'"(r) acts in the total 
space # = #,@ #,. The justification for these assumptions lies in the 
initial conditions. 

w(t) can be thought of as the operator which describes the system before 
the interaction becomes effective (i.e., for t > — oo), for it is then that the 
beam and target system are prepared. Developing in time according to 


wit) =e iKt pin pikt 


where K is the interaction free-energy operator, it describes a fictitious state 
in which the beam passes through the target without interaction. The pre- 
paration of the component systems is by way of measurements of properties 
which are uncorrelated at the time of preparation, with the consequence 
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that at this time W'"(z) is factorable into the form (5.4).’ But then it remains 
so at all times, as 
w(t) a e7 (Kat Kryt(pyin ® Wit)e+ (Ka + Kryt 
e” ‘Kat yyingiKat ® ep trying Est 
= Wat) ® Wr). (5.5a) 
The actual state of the combined system is W(t), which, while it is es- 


sentially the same as W'"(t) in the distant past, has a different time develop- 
ment [Equation (1.2)}: 


Wier Wet. (5.5b) 


Unlike W(t), W(t) is generally not factorable, because the interaction part 
V of its time-development generator H = K + V acts nontrivially in both 
H,and Hy. 

Rather than interrupt the calculations to follow with many explanations, 
we shail first make some remark about the basis vectors which will be used 
and their normalizations, and shall then list some needed mathematical 
relations. 

For the basis {|Ed>} of # = #, @ #, we shall use the direct-product 
basis consisting of the generalized K eigenvectors 


|Ea> = |pa> @ |Ern). (5.6) 


Here | p> is a generalized eigenvector with polarization A of the momentum 
operator P for the beam system, and | E;-n> is a (possibly generalized) eigen- 
vector with polarization 7 of the operator K; = K'?' for the internal energy 
of the target system.® The generalized eigenvectors |p> have the usual 
6-function normalization, 


<pAlp'A’> = 0°(p — P')daz Cy) 
while the |E;>’s may be either proper eigenvectors | E4-n> corresponding 
to a discrete eigenvalue E+ and having normalization 

CETN|ETN'> = Spard Onn’, (5.8a) 
or generalized eigenvectors |Ejn)> corresponding to the continuous eigen- 
value E> and having normalization 

CEGn|Epn'> = PET) *OCET — EF')Ogy (5.8b) 

Since the |pA>’s are also the generalized eigenvectors of the beam system’s 
energy operator 

K, = Kg(P) + K5'(A), (5.9) 


7 Cf. remarks in Section XIII.1 concerning the state of a combination of physical systems. 

8 Though K’?"' is the operator for the internal energy of the target (1.c., has no kinetic-energy 
term) we shall not exclude the possibility that it also has a continuous spectrum, as may occur, 
for example, in collision-induced scattering. 
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the product vectors |pA> © |E;n> are in fact the generalized eigenvectors 
of the energy operator K = K, + Kz. In (5.9) K§'"(P) is the kinetic energy 
operator for the projectile, the functional form of which depends on the 
nature of the projectile; for a massive nonrelativistic projectile 


RC eS is (5.10a) 
while for a photon 
TSH RO Yo aries (5.10b) 


where c denotes the speed of light. Ki"(7°?) is the operator for the internal 

energy of the projectile. The eigenvalues of K on the basis vector (5.6) are 

therefore E = E, + E;, where 
Pp int 


pe ph) Re) omens pe (5.11) 
2m m 


for a nonrelativistic projectile with internal energy levels E¥''(A), a structure- 
less? nonrelativistic projectile, or a photon, respectively. 

For the sake of definiteness we shall assume that the projectile is non- 
relativistic and has no internal energy levels, so Ez; = p?/2m. For the other 
cases in (5.11) or for the massive relativistic case Ex = (p?c? + m?c*)!/, 
minor modifications will be necessary. We shall also use the abbreviated 
notation 


= { p(ES) dES (5.12a) 
tre faye 
and 
SCE ne — One on (5.12b) 


for Equations (5.8a) and (5.8b). We may then express the product basis 
vectors |pA> © |E;»> in terms of the total energy E and the spherical 
coordinates of p: 


| Ea> = |EE7QAn)> = |pa> @ |Ern> 


= |EgQA) @ |Ern), (5.13) 
where 
p? 
E=E,+E;= a + E;, (5.14a) 
p = pQ = p(sin 6 cos @, sin 6 sin ¢, cos 8), (5.14b) 


°’ More precisely, a projectile with only one energy level, which still may have different 
polarization states. 
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and 


Q = (6, ?). (5.14c) 


Since 


», [eceae = ) |@p= ¥ |p? sin@dp dod 


Eran Eran 
= ys 2m(E — E;,) sin 6 dé dd dE, (5.15) 
Eran 


the basis vectors (5.13) must have normalization 
(EG| E’@’) = (EE, QAn| E'EZQ'1'n'y 
1 
O(E — E')bg,67.67(Q — Q')6,-6 9 (5.16 
ME = E,) ( ) ErE ( ) Ad “nn ( ) 


where 
67(Q — Q’) = d(cos 6 — cos 6')5(p — ¢’) (5.17) 


in terms of the quantum numbers E, E,;, Q, A, and n, ie., the weight function 
(measure) p,(E) of (2.9) is fixed by our normalization choices (5.7) and 
(5.8) to be 


Pr (E) = p(E — Ey) = m\/2m(E — Ez) = mp, (5.18a) 
and 6,,: of (2.9) is 
OEE O(Q — Q'\6 43:59 (5.18b) 


The mathematical relations that we need in the calculations are the 
identities!° 


+o 
| dt e~** = 276(s) (5.19) 


and 
a 27i0(s) (5.20) 
= 0 : 
for the generalized function 6(s). 
We shall also need the relationship 


<a*|W(t)|a’*> = Kal Wi(t)|a’), (5.21) 


which has an immediate derivation (Appendix XV.A) from the develop- 
ments of Section XV.3 below. Equation (5.21) is plausible for the following 
reason: W(t) is identical with W'"(t) in the distant past before the interaction 
V becomes effective, and |a*» is the same as |a> in the absence of V, so 
(5.21) holds at some time in the distant past. But the time developments of 


10 Gel’fand and Shilov (Vol. 1, pp. 168, 94). 
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W(t) and W'"(t) are generated by H and by K, respectively, which have the 
same effect when applied to |a*> and |a), respectively; consequently, (5.21) 
holds for all times t, not just in the distant past. 

We now start the derivation of explicit formulas for the cross section."' 
Inserting (2.20) into (5.3), one obtains 


+0 
“IST fate t-® <b] VJa*><al*|V1b) <a* |Wla'*> 
b aa’ —="00 


1 1 
x ee 
E,—E,—-—i0 £E,—£,+ 0 
Spite e) , 
= I dt vo Y <xA| WH(t)| x). 
00 A 


The right-hand side may be rewritten by inserting the basis vectors |pA>, 
using the three-dimensional version 


<xA|pa’> = (272) 3/2e!P"*6,,. (624) 
of (II.8.57), the time development (5.5b) of W3(t), and the identity (5.19): 


+ 00 
RHS of (5.22) = i | dt 195 ¥. far d3p’ <xA| pay 
— 0 rl 


x <pale*8Wie**2|p'A> <p'a|xd> 

= i “ak oe) fap d3p' (272) ~3e(P-P')xe- EB— Exp 
x <pA| Wi |p’A> 

= 2y4090 Y, [ad'p ap’ el? *5Ey — Es) 
x (pal We pA >d,4 

= (2n) 090) I p(E,) dEp p(Ex) dE’ dQ dQ! 


x el PSE, — Eg) XE,QA|Wi8|E,Q2'96,,, (5.24) 
where 


p = pQ = ./2mE,Q and p’ = pQD = ./2mE,&. 


The calculation of the left-hand side of (5.22) goes as follows: We use (5.19) 
to do the t-integration, (5.20) to replace the factor in parentheses, and (5.21) 
to express things in terms of the state W'"(t), which may be broken up by 


'! The result of this tedious calculation is Equation (5.38), 
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use of (5.4). The basis system {|a>} used in the calculation is that consisting 
of the generalized eigenvectors | EE;QAn> = |EgQA) @ |Epny with E = 1a 
So 


LHS of 


(5.22) = p? » 2n(E, — E,)<b|V|a*><a't |V|b> 
x (a|W" |a'> 2nid(E, — E,) 
7)" > x p(E — E7) dE p(E' — E’,) dE’ dQ dQ’ 
on 
x O(E — E')O(E — E,)<b| V| EE;QAn* > (E'E7Q'1'n'* |V b> 
x €EE,;QAn| W'"| E'EZQ')'1'> 
= (2x)? > >) | p(Es) dEg p(Eg) dEy dQ dQ’ 
” ein 
x (OE, + Ey — Ey — E7)O(Eg + Ey — E,)<b|V|EE-QAn*> 
x CBEZOQ'n'* |V (b> <(EgQA| We | EVQ’) (Epn| Wel ETA> 
(5,25) 
where 


E=E,+ EE, and E'=E,+ Er-. 
From (5.24) and (5.25) we then have 
0 = (2z)*(RHS of (5.22) — LHS of (5.22)) 


=o), [ren dEg p(Eg) dE) dQ dQ’ (E,QA| We | EQ’) 
Ax 


x |r00K(Es — E,)e-”)*6,,. 


— (2z)* > O(E, + Ey — Ex — E7)0(Eg + Ey — E,) 
be 
x <b| V| EE; QAy* > CEE; Q'n"* |V b> <Ern| welemn>} (5.26) 
We shall continue the calculation under the additional assumption that 
the beam is completely unpolarized, 


1 
CEpQA| We | E_Q1") = 7 in K EgQ| We llEp2>, (5.27) 


where g is the number of different polarization states 4. Were the beam com- 
pletely polarized with some definite value A) for 4, one would have to use 


CEpQA| Wi ERQ’Y = 633:6,,,<EgQ\l Wel EpQ> (5.28) 
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instead of (5.27) and would have to replace )°, 1/g by )°, 6,,, in the calcula- 
tions below. 

In the typical scattering experiment, the incident beam is prepared so 
that it has a well-defined (momentum) direction Q,. This condition is 
expressed by’? 


Jao dQ! (E,Q|| WH EpQ’) FQ, Q') = CEg|||WFIILEs>F(Qo, Qo) (5.29a) 
for any smooth function F(Q, 9’). The doubly reduced matrix element 


CEgll|WelllEs> = [a0 dQ! E,Q|| We EQ 


= fac (E,Q\| Wit ||E,Q) (5.29b) 


of the state W¥} with well-defined direction 9, describes the energy distribu- 
tion in the beam, which will be discussed in more detail below. 
The normalization of W' requires that 


1 =, | p(s) dBy dQ <E,0A|W3| E_2> 
A 
z { p(Ep) dE, dQ<E,Q\| W's|| EQ) 


= | o(Es) dé Epll WILE). (5.29¢) 


For an unpolarized beam (5.27) with well-defined direction (5.29a), Equa- 
tion (5.26) becomes 


. { p(Ep) dEs p(Es) dE CEallW PILES) 2005 — Ey)eit?-P')M0°x 


_ Qn)* » y > O(E, + Er — Es — E7)S(Ep + E, — E,) 


Ern 
Ern’ 


x <b|V{EE7Qo An” > CEE Qo An’* |V {b> <Epn|W? | Est. 


'? One can convince oneself that (5.29a) is the continuous-spectrum analogue of (5.28) by 
calculating for an arbitrary F,, 


iy «: ae A\Wr'l- ood NF xa = bs Oay KG 4 | Will Z DF yy: = ¢ ‘ || Wei: Pl anig: 
Ax Ax 
Intuitively one may want to write as the direct analogue of (5.28) 
(E_Q|| Wl EpQ’> = 5Q — QQ — QE alll WIE), 


which, however, would be mathematically incorrect. Equation (5.29a) is an idealization: ac- 
cording to the discussions in Section II.10 every beam must have a finite spread in momentum. 
Cf. also the discussion following (5.35) below. 
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As p = ,/2mEg, the exponential under the integral contributes only unity 
because of the 6(E, — E’;). Hence 


= foe.) dEg p(Ex) dEs <Egl|iW5lllE> {v9 0(Eg — Ep) 


Qn 
iG Sy ae (Ey + Ey — Ey — E7)6(Eg + Ey — E;) 
‘TN 
Ern’ 


x €b|V| EE; QoAn* > EET Qo An’ * |V\b><Epn| WHE pn}. (5.30) 


Recall that we have left open the possibility that the target has continuous 
as well as discrete energy levels E;; although we have written (5.30) as if the 
E; were discrete, it is also valid in the case (5.12) where there is a continuous 
part to the spectrum of Kr. In most experiments (although not for processes 
like collision-induced scattering) the state of the target system is in a mixture 
of discrete energy eigenstates (stationary states) before interacting with the 
beam: 


[K, WP] = (Kr, Wi] = 0. (5.31) 
Thus 
ae <n||Wr(ED)In’>|Ern> <Efn | 3) 
or 
CEpn|WrlETn'> = dpgeg<n|| Wr(Es) IND. (5.32) 
Then (5.30) becomes 


=| p(Es) dEx p(E) dE CElllWillE> 


On 


d dX Y) (Ep — Ep)o(Ep + Et — Ey) 


Efnn’ . 


x |ro0(Ey — E,) - 
x <b| V|EELQq4n* ) (E'E4.Q, An’ * |V 1b <n || WES vin 
7 | p(Eg)dE go(E5)<Egll|W'sllEp) 
4 
x Joyo _ = wy 2 Es + Ej — E,) 
x <b| V|EE4Qq9A*) CEELQ, An'* | Vb) <n|| Wit EB nin yh (5.33) 


where E = Ey + Et. and E’ = E, + E%. 
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By taking the expectation value of the beam energy Kz, in the beam state 
W® of (5.27) and (5.29), 


THK pW) = ¥, | p(Ep) dEg d0 (Eg1] WK |EsQ) 
a 
= | ols) dE dO E46EsQ|W31E 2 


= { p(E) dE» E,CEql|| WSlllEp). (5.34) 


we may identify 
F(Eg — Ego) = p(Es)<EalllW5lllEa> (5.35) 


as the probability density for obtaining the value Eg when Kg, is measured. 
In writing (5.35) we have used a notation suitable for the assumption that, 
as is usually the case, the experimental setup is such as to produce a beam 
whose energy is peaked around some value Ego under the control of the 
experimenter. Then F(E, — Ego) is one of the functions of Section II.10 
that in the unphysical limit case of sharp beam energy will go into 
6(E, — Ego). However, in the general case, the right-hand side of (5.35) 
could be any well-behaved function of Eg. If the resolution of the apparatus 
is good enough, i.e., if the beam is emitted with a small energy spread 
Epo — AEg < Eg < Ego + AEg, then F(E, — Ego) will tend to act like 
O(E, — Ego). More Piene, F(E, — Ego) will act like 0(Eg — Ego) in an 
integral 


| dEy F(Ep — Epo)g(Es) 


if the function g(Eg) varies slowly over the range Ego — AE, < Ez < 
Ego + AEg. In (5.33) this will be the case if the matrix elements 
<b| V| EEF Qo An>, where E = E, + E4, vary slowly as functions of Ez. 

This is not always true. Near some values of Ego the matrix elements may 
change very quickly; such a value Ego is called a resonance. 

We shall discuss the effect of the finite resolution below and continue the 
calculation here under the assumption that the beam is ideally mono- 
chromatic, i.e., that 


F(Eg — Ego) = p(Eg)<Eg\l|Wel\|Eg> > (Eg — Ego). (5.36) 
In this case the Eg integration in (5.33) can be performed to get 


y 
0= plEm) {ooo =< = 


> » (Epo + Et — E,)<b|V|EETQAp* > 


A EGnn’ 


x CEETQ 4 An!* |V [b> <n WEED IInd}, (5.37) 
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where E = Egy + Ef. Solving for o and reverting to rectangular coordinates 


Po = \/ 2mE go Qo, we at long last have our basic cross-section formula: 
ae 
o(A > Po) = ae i, <b|V |PoAEGN* > (Po AEtn'* | Vb) 
x hay »6(Eno + Et — E;), (5.38) 
where Exo = po/2m, vo = Po/m, and where we have introduced the notation 
= 1 
are) 


for the averaging over the polarizations in the initial state. We have also 
restored the h’s in order to express this important result in the usual units. 

Equation (5.38) is the cross section for the scattering of an unpolarized 
beam with a well-defined momentum py off a fixed target in a mixture Wi? 
of discrete energy eigenstates and into any final configuration whose quan- 
tum numbers b = (E,, 5) appear in the summation )». [To get the differen- 
tial or partial cross section do(b < po) for scattering into the “state” A, = 
|b><b|, one merely omits the summation over b and replaces o(A < po) 
by do(b < po).] 

Suppose that the energy spread AE, is not negligible, as will be the case if 
<b| V| EEF Q,An*> varies considerably when E, = E — Et varies within 
the interval AE, (this might happen in the neighborhood of a resonance 
unless AE, is much smaller than the width of the resonance, as will be dis- 
cussed in Chapter XVIII). Then the substitution (5.36) is not possible, and 
consequently one cannot calculate o(A < p,) as in (5.37). The quantity one 
can calculate then from (5.33) is?? 


| p(E,) dE, F(Ey — Ego)00(Es) = (pv0) + F, (5.39a) 


where v = v(Eg) = p(E,)/m = ./2mE,/m. 
This, according to (5.33), is then equal to 


(pva) « F = (2n)4 { p(E,) dE F(Es ~ Eno) SS <b|V |pabin* > 


Etnn’ 
x <paEtn’* |V|b><n||WP(EF) in o(Eg + ET — E,). (5.39b) 


As p(E,) and v(E,) are slowly varying functions of Eg, compared to the 
rapidly varying functions F(Eg — Ego) and |<b|V|EE%Q,An* >|*, one can 
take them out of the integrals on the left-hand side of (5.39a) and on the right- 
hand side of (5.39b) and replace them by their average values, p(Eg.) and 


13 If the spread in velocity is not negligible, then the incident "cries per unit area is 
fdz <x|WP(D|x> = Jat <x|¥° WiX(t)|x> instead of f dt vo<x|Wi()|x>, where ¥ ‘is the velocity 
operator. This results in a replacement of vg by v(E,) in (5.33). 
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v(Ego). Then one obtains the cross section formula for coarse-resolution 
experiments: 


a F = [dlp FEp ~ Epo)o(Es) = 2n)*h? —— | dE FEs ~ Eso) 


= 0) 
x ¥ Y (Ey + Ef — E,)<b|V|paEin*> 

b A Ednn’ 
x <paEhny’t|V|b><n|| W2(EDIIn’>. (5.40) 


We continue now with the expression (5.38) for the idealized energy resolu- 
tion, but will return to the case of a limited energy resolution in Section 
XVIIL8. 

The cross section o(A < pj) is often expressed in terms of the T-matrix. 
If the interaction Hamiltonian V is given, the T-matrix may be defined by 


(E4|T| Ed’) = (EG|V|\Ed’*). (5.41) 


This does not fully define a transition operator T, because not all matrix 
elements <Ed|T|E‘a’> of T are defined by (5.41), but only those matrix 
elements “on the energy shell” E = E’. The matrix elements <Ea| T|E’a’> 
that are “off the energy shell,” i.e., those for which E # E’, may be defined 
in various ways; and one gets differing transition operators depending on 
how these off-energy-shell matrix elements are taken. One such operator 
is T*, defined by 

<EG\T* | E'@’) = (E4|V|Ea'*). (5.42) 


Obviously the restriction of (5.34) to the energy shell gives the quantities 
<E4|T | Ea’> of (5.41). Another transition operator T~ is defined by 


<EG|T~ |E'a’) = (E4|V\E'a’~). (5.43) 


Although not obviously, this too agrees with (5.41) when restricted to the 
energy shell. Any useful definition of a transition operator T‘ must agree with 
(5.41) when on the energy shell: 


(E4|T‘|Ed> = (Ea|T | Ed’ (5.44) 


Because of the presence of the factor 0(Eg) + E; — E,) (which expresses 
energy conservation) in (5.38), it is only the on-energy-shell T-matrix 
elements that will contribute, a situation one always encounters when 
dealing with physically observable quantities. Since all transition operators 
agree on the energy shell, it makes no difference which T< is used. 

In the S-matrix approach mentioned in the previous section, it is not the 
existence of an interaction Hamiltonian V that is assumed, but rather that 
of the T-matrix (Ed| T| Ea’), which is considered the fundamental quantity. 
The expression of the cross section formula in terms of the T-matrix, 


2n)*h? 
eee yy Y. S(Ego + E+ — E,) 


A EGnn’ 


x €b|T|poAET > <PoAETN'|T|b><n||Wi(EZ)|In'> (5.38) 


o(A < Po) = 
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may therefore be considered valid in either approach.'* If a V exists, then 
<Eb|T | Ed’) is defined in terms of (5.41) and 


<Eb|T|Ed’> = y <Eb| Ed) (Ea|T | Ed’); (5.45) 


otherwise it is the T matrix <Ed|T|Ed’> that is fundamental and which 
cannot be further determined. 

In order to calculate the cross section from (5.38) or from (5.38’), one has 
to know both V and |ppAE;n* > or the matrix (Eb| T |p) AE“A), respectively. 
Even if V is considered to be known one still requires knowledge of the 
[PoAETN* > = | Eo E}QoAn*> =| EoG* > (Eo = p2/2m + E4), which are solu- 
tions of the Lippman—Schwinger integral equation. In principle the | Ey 4* > 
may be obtained by an interation process similar to the one described in 
Section VIII.1. The approximation that one obtains for the lowest order of 
the iteration process, i.e., the approximation 


<b|T* |Eod> = <b|V|Eod*> © <b|V| Ep 4), (5.46) 


is called the Born approximation, and is usually sufficient in the case of a high 
initial beam energy Ex, and a weak interaction V. 

Equation (5.38) [or (5.38’)] gives the cross section for the rather general 
situation in which the target state is left unspecified except for the require- 
ment that it be a mixture of discrete energy eigenstates. We shall now give 
the cross section for more specific target states. We first assume that W, is 
diagonal in some of the quantum numbers 7 of 4 = (y,7) and that no 
measurement has been made with respect to the other quantum numbers 4. 
For example, if the target consists of hydrogen or alkali atoms, the internal 
quantum numbers yn are yn = (n, j, j3,7), and E; = E;(njz) is a function of 
n, j, and x. Usually the atom is in a specified mixture of energy and angular- 
momentum and parity eigenstates, while the z-component of angular 
momentum has not been measured. Then 


i . . 
(njj3T|| WE, Inj’ 3 ee = yeaa Onn'j 3 0 jj WrlEx(nj, T) )O nn’ 
Similarly, in the general situation 
ccs in Shea 1 in 
Cain Wr(ED|A'N'> = : Sim Om WT(ET, fi), (5.47) 


14 Strictly speaking, the matrix elements of an operator are the numbers that result from 
placing the operator between two vectors from the same basis. Since the 6 and a may refer to 
eigenvalues of different operators, <Eb|T| Ed) need not be a matrix element,’ but it is a linear 
combination cf matrix elements. For convenience, however, we shall hereafter refer to (Eb| T | Ed) 
as a T-matrix element. 
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where g is the number of values # can take on. The cross section obtained in 
the case (5.47) is 


. (2n)*h?m WS = Age 
o(A < po, 7) = ——— © YY VL IKOIT |poAET AN | 
Po b A ES Wf 
x WHET, N)O(Ego + Et — Ey), (5.48) 


where we use the notation )., = (1/g) ¥., and )),, = (1/g) >, (averaging). 

The weight W(E4/) of each energy level may depend upon all the quantum 
numbers #E4 = (y,92---n,, EY) or only upon a subset of them, e.g., 1, Et. 
Very often it depends only upon E4: W(E%7) = W(E4). For example, if 
the target system is in thermal equilibrium, then W, is given by (II.4.50) 
and W(E7) is given by a Gibbs distribution 


e7 EnlkT e7 EnikT 
VE) =< 
Ye dime ee aE 
nEn En 


where dim #,, is the dimension of the energy eigenspace with eigenvalue E,,. 

A very special but rather common situation is the case in which the target 
is in a pure energy eigenstate, e.g., the ground state of an atom specified by 
the quantum numbers (E%, 70): 


<n|| Wi(E4 T\ln >= = Oona non’ Onde, (5.49) 
Or 
Wi = |Etno><Etnol- (5.49') 


The cross section (5.38) for scattering of an unpolarized beam off such 
targets then becomes 


oa 


m — 
o(A Po EF No) = oy IXb| VI PoAET ng > (Eo = Ee) 
boa 


ie m 
Po 


where Ey = Ego + E%.. In the Born approximation and for structureless 
projectiles, this reduces to the well-known formula 


(27)*h?m 


SY [<b] T |poAES 19> 25(Eo — Ey), (5.50) 
[Ie eh 


o(A — poEtn) = D |<b|V |PoETNo> (Ego + ET — E,). (5.51) 

Usually, the transition matrix is independent of the polarization A. Then 
the averaging over the polarizations in (5.50) is trivial, and the quantum 
numbers A could just as well be omitted. For observables that do not depend 
upon the polarization, an unpolarized beam can be treated like a beam of 
structureless projectiles; e.g., for an unpolarized electron beam one can 
neglect the spin of the electron. 

The most common experiments are those in which the differential cross 
section for scattering into a particular solid angle AQp, is measured by 
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placing a counter at a particular angle Q,,) to the incident direction. AY is 
then the subspace of states with momentum pointing in any of the directions 
Qo + AQyo. If the detector detects only those states within a certain energy 
range, then A# is further restricted. 

So far we have not specified the basis system |b) = | Eb) in the space of 
final states. In order to obtain the differential cross section for scattering into 
a particular direction specified by the angles Q = (@, ¢), one conveniently 
chooses a basis system of generalized eigenvectors labeled by Q or by the 
momentum k of the detected (scattered) particle. Thus we choose 


|b> = |Eb> = |kE> @ lel) = |EpQy)@ le 
= |EpeQn co. (5.52) 


where E = E, + € andk =kQ. € are the internal quantum numbers of the 
detected particle (polarizations) and ¢, ¢ are the internal energy and other 
internal quantum numbers of the target. Thus our description is general 
enough to include the case where the detected particle may be of a different 
kind than that of the incoming beam, and the particle left behind (post- 
collision target) may be different from those the target originally consisted 
of. For example, a photon y might be incident on an atom A and ionize it, 
leaving behind an ion A* with an electron coming off: y+ A—>e+ A”. 
A simpler example for the reader to keep in mind is the elastic or inelastic 
scattering of an electron or photon beam by an atom: e + A> e’ + A*. 
If the particles detected are nonrelativistic, then 


Ep = k*/2mp + E56), (5.53a) 
while if the particles detected are photons, 
Ep = ke. (5.53b) 


We shall impose the same normalizations as in (5.7) and (5.8) on the 
|ké> and |ef): 
CkE| KE = (kK — k’)dge (5.54) 


and 
Kel lel) = Ope Ore (355) 


Then in the case (5.53a) we have by derivations similar to those of (5.15) 
and (5.16) that 


i= y [owe fe, | mo/2molE — EM — e)dEdQy 
b b Ce¢ 


= ) pz(E) dE dQp (5.56) 
cel 


and 
CEQ 6 C| Ee’ Q oC) 
= mp'(2m,(E — EZ) — 2) 1 75(E — E')6. 6°(Qy — Xp) bee dx 
(557) 
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Equation (5.50) is then written in detail as 
a(A — PoETNo) = Gmnithetip y >, [aban,ne: Cra) 
Po eee 
x [KEN EC |T |PoAETNo> |?O(Eo — E’), (5.58) 
where 
E° = p2/2m, + Ef} (initial energy) 
= ey, 2m,(E' — EX(é') — €) (momentum of the detected particle) 


The ranges of the summation over ¢’, «’, ¢’ and the integration over E’, Op 
depend on the nature of the detection apparatus specified by A. From (5.58) 
we see that the differential cross section per unit solid angle for scattering 
in the direction Qp is 


2n h te pe A f re ge 
(, (20 + PoE? No) = Coates y [aewee, 2) 


Po Gh tele 
x [CEC QD EC |T |PoAETNo> |?O(Eo — E'). (5.59) 


We shall now assume that the projectile and the detected particle are the 
same (mg = mp = m) and that it is structureless or that the polarization 
quantum numbers 4 and € can be ignored.!* 4, and ¢ are then eigenvalues 
of the same set of operators 7°. The internal energies E{ or ¢’, which are 
eigenvalues of the same operator K ;, are assumed to be already determined 
by the internal quantum numbers, i.e., K; = K;(n°") as is the usual con- 
vention for targets with discrete energy spectrum.'® Then the expression 
for (5.59) simplifies to 

4,2 
= (Q — pono) = ee) [ae RE, (JOE, = E) 


x [CB'Q, OT |Potto> |’, (5.60) 


where 


= ./2m({E’ — Ex(0’) and Ey = p3/2m + Ex(no). 

If not only the initial total energy is fixed (by fixing 7) and the beam 
momentum p,) but also the final total energy (by choosing a detector that 
registers only a particular momentum k), and if also the final internal quan- 
tum numbers are fixed (by triggering the detector only if the final internal 
quantum numbers have the value ¢), then the differential cross section is 
obtained from aie as 


(2n)*h h2 mk _ 
(OEE = Polo) = re |<EQE| T|Pono>|?, (5.61) 


'S One could also imagine that the polarization A is incorporated in the quantum numbers . 


"© If the target is the Hydrogen atom, then y = (n, j, j;, =) and Ey = Ey = E,(n) given by 
VEST 2); 
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where 


k = \/2m(E — E70), 
which must be equal to 


2m( Se + E7(o) — E 3) 
ora To z 


because of the energy-conservation 6-function 6(E, — E). For the case of 
elastic scattering, ie., when the internal quantum numbers are not changed 
(N> = ¢), one obtains the well-known expression 


da 
re) (QEo No — PoNo) = (22)*h?m? |< Eo Qo! T|Pono> [F (5.62) 


The generalized eigenvectors in (5.62) [and also in the preceding expres- 
sions (5.61) and (5.60)] are normalized by (5.54) and (5.7) according to 


(EQn| E'Q'n'> = Cky| kn!) = by 5°(k — k’). 


If one uses generalized eigenvectors with a different normalization, then 
this expression changes correspondingly. 

For spherically symmetric interaction, the T-matrix and (therefore the 
differential cross section) will not depend upon the angle ¢ around the in- 
cident direction po/po, but only upon the angle 6 between the incident 
direction and the final direction Q = k/k (cf. Figure 3.1). (A derivation of 
this statement will be given in Chapter XVI.) 0 is called the scattering angle. 
In this case one often uses instead of the T-matrix element the elastic- 
scattering amplitude T(po, 6) defined by 


T(po, 9) = —4n?m(E,Q, NolT |Poto>- (5.63) 


The differential cross section (5.62) is then written as 


d 
fe (Eo, 9) = |T(po, 9) |. (5.64) 


Problems 


1. The quantity q = k — po, where p is the projectile momentum before scatter- 
ing and k is the projectile momentum after scattering, is called momentum transfer. 
(It represents the momentum that is transferred to the projectile by the target.) 

(a) Show thatthe 7matrix ¢k, 4)|T|po. 49> in the Born approximation ts a function 
of the momentum transfer only and does not depend upon the momenta 
Pp, and k separately if the interaction Hamiltonian V is a function of the projectile 
position. 

(b) Express the magnitude of the momentum transfer in terms of scattering momen- 
tum and scattering angle (for nonrelativistic particles), and show that in the 
Born approximation the forward T-matrix element ¢Po,%o|7T|Po,%o> iS 
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independent of scattering energy if the above interaction Hamiltonian is 
spherically symmetric: [V, L] = 0. 
(c) Show that at high energies (when the Born approximation is usually good), the 
scattering falls off as the scattering angle 6 increases from 0 to 7. 
2. Obtain the expression for the differential cross section (5.62) in terms of the 
T-matrix 


(EQ|T| EQ’) 
defined with the generalized eigenvectors | EQ), which are normalized according to 
(EQ|E'Q) = OE — E’)6(Q — 2, 
where the solid angle 6-function 6(Q — Q’) is defined by 
J dQ fQ)IOQ — Qo) = fo) 


with dQ = sin 6 dé dd. 
3. The Yukawa interaction was introduced to describe nuclear interactions and 
led to the prediction of the meson. It is described by the potential 


VAG) Saye ip 


and can also serve as a simple model for the screened Coulomb field of an atom. 
(a) Calculate the scattering amplitude T and the differential cross section in Born 
approximation. 
(b) The second-order term in the Born series of the scattering amplitude is given by 
[cf. (XV.3.38)] 


T?(k — po) = —4n?m¢kno|T|Pono> 


Ck|V |p ><p'|V |po> 
E,, — Ep + i0 


= —4n’m | d*p' 


Calculate the forward-scattering amplitude, i.e., the scattering amplitude for 
Po =k, up to the second order of the Born series, and find the values of the 
parameters m, 1, dnd g, and the energies for which the necessary condition for the 
validity of the Born approximation, | T?|< |T|, is fulfilled. 


CHAPTER XV 


Formal Scattering Theory and 
Other Theoretical Considerations 


Notions used in Chapter XIV for the derivation of the cross-section formula 
will be further discussed in this chapter to provide a deeper understanding 
of the material. In Section XV.1 the Lippman-—Schwinger equation is discus- 
sed again. Sections XV.2 and 3 are on formal scattering theory. In the 
Appendix a derivation of (XIV.2.9b), (XIV.2.17), and (XIV.5.21) is given, 
using the material developed in this chapter. 


XV.1 The Lippman-Schwinger Equation 
The Lippman-Schwinger equation 


Via*> (L.1+) 


1 
eat 
te? ee RO 
was introduced in Section VIJI.2. To each generalized eigenvector |a> of K 
with eigenvalue E, the Lippman-Schwinger equation relates a generalized 
eigenvector |a*>) or |a > of H = K + V with the same eigenvalue E,. 
(In Section VIII.2 it was assumed that the continuous parts of the spectra 
of H and of K were identical and that E, was an element of these continuous 
spectra.) The generalized eigenvectors |a> of K and |a*) of H are labeled 


by the energy E, together with some additional labels ad = (a,, a2,..., a,): 
|a> = |E,4), (1.2a) 
|a*> = |E,a*). (1.2b) 
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In Section VIII.1 it was assumed that the d were additional quantum numbers 
for the K basis, i.e., that 


{K Ay As. Ay (13) 
was a complete system of commuting observables (c.s.c.o). Suppose that 
1H A, Aces, Agt (1.4) 


is also ac.s.c.o. Both the generalized eigenvectors |a> of (1.3) and the proper 
and generalized eigenvectors |a,) and |a™ > of (1.4) are a basic of the space of 
physical states. . 

The assumption that (1.4) is a c.s.c.o. was not used in the derivation of the 
Lippman-Schwinger equation. Suppose that 


CRBs Doyen BES (6 R)) 
is ac.s.c.o. but that {H, B,, B,,..., B,} is not, ie., that 
[B;, H] = [B;, Vj] #0 (1.6) 
for some or all of the i’s (i = 1, 2,..., k). By repeating the arguments of 
Section VIII.2, the basis vectors 
[b> = |E,5> (1.7) 


of K may be related by way of the Lippman-Schwinger equation to the 
generalized eigenvectors 


|b*> = |E,(6)*> = |E,6> + E V|E,(b)*> (1.8) 


»-K+i0 
of H. However, the vectors \E,(b)> are generally not eigenvectors of the 
operators B;, because of (1.6). In this case the b;’s are defined not by a c.s.c.o. 
containing H, but only through the Lippman-Schwinger equation and the 
¢.s.c.0. containing K. [The parentheses around the label 6 are a temporary 
notation used to indicate that b has a different status—that of a label only— 
than that of the label E,, which indicates that |E,(b)* > is an eigenvector of 
H corresponding to the eigenvalue E,. Later we shall not make this dis- 
tinction. ] 

As an example of these possibilities consider the scattering of structureless 
particles off a fixed target. Let Q and P be the (canonically conjugate) 
position and momentum operators for the projectile, and assume the inter- 
action of the projectile with the fixed target is given by a spherically sym- 
metric potential V = V(Q) [Q = (Q’)!'’7]; the Hamiltonian for the system 
is then H = K + V, where K = P*/2m is the kinetic-energy operator for 
the projectile. Since [L;, V] = [L;, K] = 0, it follows that 


OO bee bee (1.9a) 
and 
1H Lea (1.9b) 


are both c.s.c.o.’s and that we have the case (1.4). The | Ell,> and the | Ell? > 
are corresponding generalized eigenvectors. {|Ell,>} is a basis for the entire 
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space of physical states, but {| Ell} >} is a basis for the “ subspace of scattering 
states” only. The projectiles in scattering experiments are usually prepared 
as collimated beams going in a specific direction, and it is therefore more 
convenient to use for the K-basis the generalized eigenvectors 
|b> = |p> = |Ep,p2> = | E6¢) (1.10) 

of the momentum operators P;. The c.s.c.o. of which the vectors (1.10) are 
eigenvectors is 

{P,,P2,P3} or {K,P,,P,} or {K, directionsof P}; (1.11) 
we then have the case (1.5) with (1.6), because 

ey = LF ;, V1 7 0: 

The label p for the H eigenvectors 


|(b)*> = |(p)*> (1.12) 
does not then mean that the |(p)* are eigenvectors of the P; but only means 
that the |(p)*> are connected to the momentum eigenvectors |p) by the 
Lippman-Schwinger equation, 


\(p)*> = |p> + Vi@p)*> (=p’/2m). (1.13) 


1 
E—K+i0 
The assumption that (1.3) is a c.s.c.o. for a particular physical system 
can only be justified by physical motivation. It is equivalent to the assumption 
that the set 


{| Eda) :(E, a) € spectrum (K, A;,..., A,)} (1.14) 


of generalized eigenvectors of (1.3) forms a basis for the space of physical 
states # of the physical system. Although the set 


{| Ed), :(E, 4) € spectrum (H, A,,..., A,)} (EPS) 
of generalized eigenvectors of (1.4) is also a basis for #, the sets 

{| Ea*):(E, @) € spectrum (K, A;,..., A,)} (1.15+) 
and 

{| Ea” >: (E, a) € spectrum (K, A;,..., A,)} (1.16—) 


of generalized eigenvectors of (1.4), which are obtained through the 
Lippman-Schwinger equations (1.1 +) and (1.1 —), are not generally bases 
for #, nor are the sets 


{| E(@)*) : (E, 4) € spectrum (K, A,,..., A,)} (1.17+) 


in the case that (1.4) is not a c.s.c.o. Usually the operator K has only a 
continuous spectrum, but H has in addition to this (same) continuous 
spectrum a discrete spectrum {E,} as well. The proper eigenvectors |E,a> with 
energy eigenvalues E,, in the discrete spectrum of H describe pure physical 
states corresponding to the bound states of the projectile-target system, and 
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span a subspace of bound states A y,g. The generalized eigenvector |Ed* ) 
or |Ed~) of H with energy eigenvalues E in the continuous spectrum span a 
subspace of scattering states #..4,, which is the orthogonal complement’ of 
oO vad 


KH —_ KH ona DB A scar: (1.18) 


The subspaces W , and # _ that are spanned by the sets (1.15+) and (1.15—) 
need not necessarily agree with W,,4, or with each other. Under very mild 
conditions on the inveraction V, which are generally fulfilled by physical 
scattering systems, the subspaces # ,, #_, and H..a, agree: 


KH. = KH _ = FE scat (1:19) 


The system for which (1.19) is fulfilled is said to be asymptotically complete. 

Under very general assumptions we thus have the following situation: 
The space of physical states W is spanned by the set of {| Ea>} of generalized 
eigenvectors of K, and is the direct sum # = # ona © FH oom Of the space 
H sna Spanned by the set {|E,4>} of proper eigenvectors of K and the space 
H a, spanned by either of the sets {|Ea@* >} or {| Ed” } of generalized eigen- 
vectors of H related to the | Ea> by the Lippman-Schwinger equation (1.1 + ). 

The Lippman-Schwinger equation is an equation for the |a* >. It can be 
iterated as described in Section VIII.1. It can also be solved formally. For 
any two invertable operators A and B the following easily established 
relation holds: 


1 ee 


1 
s— a (1.20) 


A special case of this is 


1 1 1 ee 
Pon20 Foe se er 


(1-213) 


By applying this to V|a*> and then using (1.1+) twice, we obtain the 
“solution” 


1 
as —- eer 
la" >= |a> + ae 9 V la. (1.224) 


The operator analogue of Equation (XIV.5.9) is 


I 1 ee 
Tot Eos (1.23) 


! This mathematical result is not difficult to accept if the discrete and continuous spectra of 
H are disjoint. It may happen, however, that some of the discrete eigenvalues coincide with 
values of the continuous spectrum (e.g., the doubly excited states of the He atom: see Sections 
XI.2 and XI.3). To such eigenvalues correspond both proper eigenvectors in #,,, and gener- 
alized eigenvectors in the set of generalized eigenvectors that span *#,.,,. Such proper and 
generalized eigenvectors are orthogonal, even though they belong to the same eigenvalue of H 
and the same eigenvalues d = (a, a2,..., ANON lig lng coogeebd: 
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which together with (1.22+) and (1.22—) allows us to relate the H eigen- 
vectors |a*> and |a™ > to each other: 


la*> — ja“) = —2nid(E, — H)V|a). (1.24) 


The formal solution (1.22 +) is not of much practical use, but it does serve 
as a way of introducing the S-matrix, which we shall meet in Section XV.3. 


XV.2 In-States and Out-States 


In this section we discuss the significance of the labels + and — that appear 
in the generalized eigenvectors |Ed*» and |Ed~ > of the exact Hamiltonian 
H = K + V. In these discussions we shall consider pure physical states as 
described by state vectors; the results are then easily carried over to statistical 
mixtures as described by statistical operators. 

Let us first describe the fictitious case where there is no interaction between 
beam and target. This means that the beam passes through the target without 
being affected at all. In the simplest case of potential scattering this means 
that the potential is turned off. The state vector ®(t) that describes this 
fictitious situation develops in time according to the free Hamiltonian K: 


O(t)=e*h (= (0). C1y) 


It is called the free state vector, and the state it describes is called the free state. 
To simplify the discussion we shall assume that ® is an eigenvector corre- 
sponding to the eigenvalues @ = (a;,a2,...,4,) of those operators A,, 
A,,..., A, that together with K form ac.s.c.o. (We assume the eigenvalues of 
A,, Az,..-, A, to be discrete.) Sometimes we will write O(a, t) instead of 
@(t) in order to emphasize ®(t)’s preparation as an eigenstate of the 47's. 
The probability distribution for obtaining the energy eigenvalue E when K 
is measured in the state ® is 


<EG|®) (®| Ed) = |<Ea|®)|?. (2.2+) 


Let us assume that the energy distribution is described by the function $(E), 
which is related to (Ea|®> by 


@(E) = 2np,{E)<Ea|®), (2.3+) 


where p,(E) is the normalization function for the generalized eigenvectors 
of K and the generalized eigenvectors of H as given by (XIV.2.9a) and 
(XIV.2.9b). That is, we assume that the state ®(t) has been prepared at some 
time such that at t = 0, and therefore at any other time ¢ it has the probability 
distribution: 


(EG|®(t)> <P(t)| Ea) = (Ea| > <®| Edy 
= (2np(E))*| ¢(E)|? (24+) 
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for the eigenvalue E of the energy operator K. ® may then be written in 
terms of the generalized eigenvectors | Ed) of K as 


® = O(0) = fo.te) dE <EG|®)>|Ea> = = [ae Q(E)|Ea>. (2.54+) 
The free state vector at any other time t is given by 
C= 2 [ae e '#"@(E)| Ea). (2.6+) 
Let us now consider the state vector 
®* = 0'(0) = = [ae Q(E)|Ea*> = ze) dE <EG|®)|Ea@*>, (2.7+) 


where | Ed*» are the generalized eigenvectors of H that are connected with 
the generalized eigenvectors | Ed) of K by (1.1+). The H energy distribution 
of ®* is the expectation value of the “operator” )\, | Ed’*)<Ed’* |: 


co i( Ika") <Ea*|)i@*) = LI<Ea’*|6*>)? = |CEa|®>|*. (28+) 


[Here we have used both the expansion (2.7+.) of ®* and the normalization 
(XIV.2.9b) of the | Ed*>’s.] Thus, the probability distribution for obtaining 
the value E when H is measured in the state ®* is the same as the probability 
distribution for obtaining the value E when the free energy K is measured in 
the (fictitious) state ® [or ®(t)]. For any other time ft, earlier or later, the 
state that develops to or from the state under the influence of the interaction 
is given by 


’ 1 ’ 
SO=e "Om ms [ae e”'*'| Ed* G(E). (2.9+) 


@*(t) is called the exact state vector. D*(t) and M(t) are states with the same 
quantum numbers 4 and the same energy distribution $(E); their difference 
is that ®*(t) develops according to the exact Hamiltonian and describes 
the development of an actual state, while ®(t) develops according to the 
free Hamiltonian and describes the development of the same state as if 
there were no interaction. 

The connection between ®*(t) and (rt) is obtained if one inserts the 
Lippman-Schwinger equation (1.1 +) into the integral of (2.9+): 


o*()= - [ae e h(E) |Ea> + 5 [ae e” F'O(E) V|Ea*» 


E—K+10 


1 
= M(t) + ay [ae e 'FtO(E) V|Ea*» (2.10+) 


J SU 


The first term in (2.10+) is given by (2.6+); the calculation of the second 
term is a purely mathematical task and is given below, after (2.20—). Using 
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the result of that calculation one obtains 


O*(t) = O(t) + i dt'Go(t — t)Vd*(t’) (2.11+) 
where 
—ie'™ ift>0 
CG) 18 Pee , 
o (t) iO( + te™' ‘5 on (2.12+) 
Q(t) is the unit step function 
ite 0, 
A(t) = ‘ 
©) ‘0 ift <0. ee 


Suppose we now take the limit t > — 00 in (2.11+). As the limit is taken, 
we eventually get t < t’ for any value of t’, so that the integrand vanishes. 
Thus 


®*(t) > ®t) ast —oo. (2.14+) 


This means that if in the remote past when the interaction was not effective 
a state had been prepared so as to have the energy distribution ¢(E) (and 
the quantum numbers 4), then this state would develop so that at time t it 
is given by ®*(t) of (2.9+). Thus ®*(t) describes a state that develops from a 
state ®(t) prepared in the remote past when the interaction V was not 
effective. The prepared state is called an in-state: 


P(t) = OM(1) [= OG, 1)] (2.15+) 


Describing the behavior of ®*(t) in the distant future when the interaction 
V ceases to be effective is another free state, the out-state: 


Ot) [ =O", ty] = oe iKtgout ga = o™'(0)]. (2.16+) 


In the present situation, where the exact state ®*(t) is prescribed by its 
behavior in the distant past as ®'(t), the outstate is unknown and un- 
controlled, except through our knowledge of V and our control over '"(1). 
As will be discussed in the next section, the connection between the in-state 
'"(t) and the out-state O°''(t) = S@'"(t) is the scattering operator S. 

We shall now repeat the same arguments used above but using the general- 
ized eigenvectors |Eb~> of H that appear in the Lippman -Schwinger 
equation 

1 
|Eb~> = | Eby + a0 V|Eb-> Ou) 
instead of the |E4*) appearing in (1.1+). Here 6 = (b,, b,..., b,) are the 
eigenvalues of a set of operators B,, B,,..., B,, which are possibly different 
from the operators A,, A2,..., A,, but which together with K form a c.s.c.o. 
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(The spectra of B,, B,,..., B, are assumed for simplicity to be discrete.) 
We thus consider the free state 


Pi) [= YO, 1] = e KP = = [ae e~ F4y(E)| Eby, (2.6—) 


where the function w/(E) describes the energy distribution of ‘P(t) as it will be 
prepared at some time. W(E) is connected with the probability distribution 
for the energy operator K by 


(Eb| P(1)> (P(t) Eb> = (20 ;(E))~7 |W (E)I?. (2.4—) 


The function 6;(E) is a weight function for the normalization of the |Eb)’s, 
just as p,(E) was a weight function for the normalization (XIV.2.9a) of the 
|Ed>’s. From its expansion (2.6—), W(t) is obviously an eigenvector of 
B,, B,..., B, corresponding to the eigenvalues b. 

The exact state vector ‘¥~ (t) corresponding to ‘P(t) is defined to be 


1 e 
2 Oe on [ae e~ #t)(E)|Eb- >, (2.9—) 


where | Eb~> are the generalized eigenvectors of H that are connected with 
the generalized eigenvectors |EbY of K by (2.17—). Again P(t) and P(t) 
are states with the same quantum numbers b and the same energy distribu- 
tion W(E), but ‘¥~(t) develops according to the exact Hamiltonian H, while 
Y(t) develops according to the free Hamiltonian K. To obtain the connection 
between ‘Y (t) and Y(t) we insert (2.17 —) into (2.9—) and obtain 


| 1 1 
Y(t) = oF [ae eE4W(E)| Eby =F ae [ae e 'FAW(E) pk a0 V|Eb-» 


i —iEt 1 7 = 
= ¥) + = [are 8) ey ee Qne=) 


The result of a calculation similar to the calculation to be given for the second 
term in (2.10+) gives 


(tt) = ¥G) i dt'Go(t — t') Vw (t’), (2.11—) 
where 
er +ie “ f7< 0 
Go (t) = +i0(—De"™ = _ 
Os Ste 1) ft >0. Ce 
If we now take the limit t > +00 in (2.11—), we obtain 
Y(t) > P(t) ast +00 (2.14—) 


This means that if in the distant future a state is measured that has the energy 
distribution y(E) and quantum numbers 6, then this state was given at time t 
by Y(t) of (2.9—). Thus Y(t) describes a state that will develop into a 
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known state in the distant future when the interaction V is no longer effective. 
This state will be called an out-state: 


YQ) = PY") [= P"'(5, 2)] (Q15=) 


Describing the behavior of ‘¥~(t) in the remote past before the interaction 
was effective is another free state, the in-state: 


Pir) (= P76, Dd] = ev pp = Wi), = (2.16 -) 


Now the exact state ‘¥~(t) is prescribed by what its behavior will be in the 
distant future as pour, t), and it is the in-state that is unknown and un- 
controlled, except through our knowledge of V and our control over ¥°"(t). 
The connection between the in- and out-states is again P(r) = SW¥i(r)— 
or, since it is our control over ‘¥°"(t) that should be emphasized, ¥'(t) = 
ge TSE) = SP). 

We have thus found the meaning of the labels + and — : ®*(4, t) describes 
a state that in the remote past, before the interaction V became effective, 
was prepared as 0'"(4, t) with well-defined quantum numbers 4 and a certain 
energy distribution $(E£). In the distant future it will again become a free 
state, an out-state O°"'(t); however, this state is not simple, since it is deter- 
mined not only by the preparation but also by the scattering process. 
\Y~ (b, t) describes a state that in the distant future, after the interaction V has 
ceased, will be given by the free state ‘¥?"'(r) with a simple energy distribution 
W(E) and well-determined values ) for the other quantum numbers. In the 
remote past (bh, t) was also a free state ¥'"(t), but its properties must have 
been more complicated. Since in scattering experiments it is the behavior of 
the system in the distant past over which we exercise control, it is the + states 
@®* (4, t) that are natural to use when describing scattering experiments. 

Our results may be summarized as integral equations for the exact states 
®*(d,t) and Y~(b, t) in terms of the controlled free states ®'(d, t) and 
ae, t): 


@*(4, t) = ®'(4, t) + i dt’'Go(t —t)V@*(4,t'), (2.18 +) 


W-(b, t) = Pb, t) + | dt'Go(t —t)V¥-(6,t'). (2.18 —-) 


(There are also corresponding integral equations for ®*(4, t) in terms of 
@°"'(d, t) and for ‘Y~(b, t) in terms of ¥'"(b, t), but in view of the uncontrolled 
nature of ®'(4, t) and ‘¥'"(b, t), such equations have little meaning.) 

The preceding results are easily extended by linearity to the case of a 
mixture. The exact state that was described in the distant past by the statistical 
operator 

Wirt) = Y Wran| OG, t)> <O'"(G,, 0) (2.19+) 


mn 
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is 
W*( = Y) Wenl On Gas P< OD, (a, 0). (2.204) 


and the exact state that will be described in the distant future by 


w(t) a Dy Wee Pepe, t)> CED. t)| (2:19) 
is 
W(t) = oy We wate t)> vues t)|. (220-) 


We shall now take up the purely mathematical task of showing that 
Equations (2.11+) follow from (2.10+), using results from distribution 
theory, in particular results concerning the Fourier transforms of generalized 
functions. 


[The Fourier transform g = F[g] of a function g(t) is defined by 
HE) = Fad) = [de® a0. (2.21) 
The inverse Fourier transformation F —' is given by 


1 hia ; 
FE =5- | dee) (2.22) 


Strictly speaking, (2.21) is the definition of a well-behaved function. 
The Fourier transform of a generalized function f, defined by the 
linear functional f(¥) = (f,‘¥), is defined by (F[f], F[W]) = 
2n(f, ‘¥). But as the rules for Fourier transforms are retained for 
generalized functions, when interpreted properly one may ignore 
this mathematical precision and use (2.21) and (2.22) also for 
generalized functions.” In particular, one always has 


g = F'[g] = F"'[F{g]] = FLF" “fol. (2.23) 


Translation in the space of Fourier-transformed functions is 
obviously given by 


GE — E') = F,[e*“g(t)]. (2.24) 


If we apply (2.24) to the Fourier transforms 


F,(6[£t)] = +i 


E+10 (2.254) 


? Gel’fand and Shilov (1964, Vol. 2, Chapter III; Vol. 1, Chapter IT). 
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of the generalized function (+t) and 0(—1), we get 


1 
= Tio kt 
E-E +10 F,[ Fie '0(+1)]. (2.26+) 
The convolution f * g of two functions f and g is defined by 
(Feg= | at feo) (2.27) 
and has the particularly simple Fourier transform? 
FLf*g] = F[f]- Fig], (2.28) 
or equivalently, 
F-'Lf-g]=frg. (2.29) 


With the above mathematical facts we shall show the equivalence 
of the second term in (2.10+) and (2.11+,). If we take the scalar 
product of ®*(t) as given by (2.10+) with an arbitrary chosen K 
basis vector |a’> = | E’a’) [more precisely, if we consider the value 
of the functional |®*(t)) ¢®* * = ® at the point <a'|—€®*], we 
obtain 


1 | 1 
’ + ae , —iEt 
(a |®*() = <a’ OO) + 5 | dE e“™46(E) 
x <a'|V| Eat). (2.30+) 
Equation (2.9 + ) implies 


a'|V@*()> = dee“ 4(E)<a'iV Ea") 


= F, *[@(E)<a'|V | Ea* >], (231+) 
or equivalently, 
p(E)<a'|V|Ea*> = F,[<a'|V|®*(0)>]. (2.32+) 


The second term on the right-hand side of (2.30+ ) is now recognized 
as the inverse Fourier transform of the product of two Fourier 
transforms [cf. (2.26+) and (2.32+)]. By using (2.29) we may 
rewrite (2.30+) as 


<a'|®*(t)> = Ca‘ | ®(t)> + (—i0(+ De") * (<a‘|V IO" (0) 
= <a'|®(t)>) — if dt' A+(t — tye"Fe™ 
x <a'|V|®*(1)). (2.33 +) 


3 This statement has to be qualified somewhat in the case of generalized functions. See 
Gel’fand and Shilov (1964, Vol. 2, Chapter III). 


346 Formal Scattering Theory and Other Theoretical Considerations 


Since [a’) was an arbitrary K basis vector, we have shown 
|D*(t)> = |®@)> + | dt Gott —¢)V|O*(¢)>, (2.114) 


as we set out to do. The equivalence of the second terms in (2.10—) 
and (2.11—) may be shown by a similar calculation. 
The operators 


Gt(t) = Fib(+He7™ 


are seen by (2.26+) to be the inverse Fourier transforms of the re- 
solvents 


Gi(E) = (E — K + i0)~' (2.34) 


of the operator K. The operator Gg (t) is known as the retarded 
Green’s function, and Go(t) is known as the advanced Green’s 
function. (In physics the distinction between a function g(t) and its 
Fourier transform g(E) = F;[g(t)] is not always reflected by the 
terminology, and so the resolvents Gi (E) are sometimes also called 
“Green’s functions,” as mentioned in Section VIII.1. Physicists 
frequently blur the notation as well, and write Gg (E) for Ge(E); 
the argument of the function is then used to tell whether the function 
or its Fourier transform is meant.) | 


The operators Gi(t) are both free Green’s functions, as opposed to the 
exact Green’s functions given by 


. 0 ift <0 

ot eee = (Ht : ’ 
G*(t) = —i0(+tbe ae ft > 0, (2.35+) 
Se in ee tiee it <0, 5 ee 
G(t) = +i0(—de = ‘% eo (2.35—) 


These latter Green’s functions may be used to express formal solutions of 
(2.18 +) and (2.18—): 


+00 
O* (4, t) = O'(4, t) + | dt’ G*(t—r)VO%G4,t'), (2.36 +) 


(6, t) = Bob, t) + | ” dt’ G-(t — r)V PB, 2). (2.36—) 


The derivation of these solutions, which is left as an exercise, parallels the 
derivations of (2.11+) [or equivalently of (2.18+)] except that where the 
derivations of the latter use the Lippman-Schwinger equation (1.1+) and 
(2.17—), the derivations of the former use the formal solution (1.22+,) 
and the formal solution for the | Eb~ )’s corresponding to (1.22). 

One final remark should be made: None of our arguments have depended 
on the assumptions that ®'(r) is an eigenvector ©'"(d, t) of the operators 
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A,, Az,..., A, and that ‘¥°" is an eigenvector ‘P°""(b, t) of the operators 
B,, B,,..., B,. All equations for pure states (as opposed to statistical 
mixtures) are therefore equally valid, by linearity, if we remove the restrictions 
that ®'"(¢) and ‘¥°""(t) must be eigenvectors of the operators A,, A2,..., A, 
and of the operators B,, B,,..., B,, respectively. 


XV.3 The S-Operator and the Moller Wave Operators 


In a scattering experiment a state is prepared before the projectile and target 
start interacting with each other, and a state is detected after they stop 
interacting with each other. Thus in scattering experiments in-states are 
transformed into out-states. The operator that describes this transformation 
is called the scattering operator or S-operator. For every physical system 
undergoing collisions we postulate the existence of a unitary operator S 
that transforms in-states into out-states. The S-operator is so immediately 
related to the directly observable quantities (like cross sections) that know- 
ledge of it leads at once to the prediction of these quantities. 

As remarked in Section XIV.4, the concept of the S-operator makes sense 
even if a generator H of time development is not defined and the time- 
development axiom (V in Chapter XII) does not hold. If H is not defined, 
then neither are its generalized eigenvectors |a*> nor the exact states 
@®*(t) and ‘Y(t). However, the states ©" and ‘¥°"', which describe the pre- 
pared and detected states, and the operator 


5: — Ort = SO, (3.1) 


which transforms the prepared (and hence controlled) in-state ©" into the 
uncontrolled out-state ©", are still valid concepts. If the time development 
axiom does hold (and there has been no evidence to the contrary in non- 
relativistic quantum physics), then the S-operator and its matrix, the S- 
matrix 


CE So) G.2) 


may be expressed in terms of more fundamental and less directly observable 
quantities. . 

We shall now do this by giving a precise definition of S in terms of pre- 
viously introduced quantities. The modulus squared of the matrix element 


(P~ (6, t), ®*G, t)) (3.3) 


gives the probability of finding the state ‘Y~ (b, t), which is observed after the 
interaction V has ceased to be effective as the state wou t), if the state of 
the system is ®* (4, t), which was prepared before V became effective as the 
state ®'"(4, t). That is to say, (3.3) describes the probability for a transition 
from an initial configuration, described by the quantum numbers a and the 
energy distribution ¢(E), into a final configuration, described by the quantum 
numbers b and the energy distribution (E). We first develop an expression 
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for (3.3) in terms of the in-state © and the out-state ‘¥°"'. To do this we shall 
use the formal solutions (2.36 + ) and (2.36 —), 


+a 
Ot (1) = Ot) + i dt’ Gt(t — t)V@'Xt’') (3.44) 
and 
+00 
(i ee { dt’ G°(t — Vv P"'(t'), (3.4—) 
of (2.18+) and (2.18 —). Equations (2.6+) and (2.15+) imply 
Min(t’) = eT KE -D@Min(t) (3.5+) 
and 
a a = eR pours). (3.5 —) 


These expressions may be substituted into the integrands of (3.4+) and 
(3.4—) to get 
O*(t) = QTM (1) (3.6+) 


and 
(py, erp). (3.6—) 


where we have defined the Moller wave operators 


+ co 
OE = (a5 | dG *(t— 1 Vea” 
= el 


= ]F i| dt'G*(t — t')VG5(t' — t) 


+o 
=a | dt” Gt(—t")VGt(t’). Cg 


An immediate consequence of (3.6+) is that the mixtures W *(t) of (2.20+) 
and W ~(t) of (2.20—) are related to the mixtures W'(t) of (2.19+) and 
W(t) of (2.19—) by 


Wt@® =Q*wW™@Qtt (3.8+) 
and by 
W(t) =Q°W"(1)Q-1, (3.8—) 
Use of (3.6+) in (3.3) gives 
CH- (6, 1), OG, 1) = (6, 1), Q-10* OG, 1) (3.9) 


This last equation motivates 
§S=Q-tQt (3.10) 


as the definition of the scattering operator. 
The last expression for Q* in (3.7+) shows Q* to be time-independent, 
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which implies that S is also time-independent. So is the matrix element (3.9), 
For ®*(t) is given both by 


Ot (t) = e Him = e~ HIC+ Mind) 
and by 
*(t) _ Q* O'"(t) = Qt en ikiqin 
Since {@'"} spans the space of physical states, we conclude that 
Co ee ul) 
[The upper equations follow from the stated argument; a similar argument 
gives (3.11 —).] Equivalently, one has the so-called intertwining relations 
Oe = OK. (12a) 
It is then easily shown that 
bo] — 0, (3.13) 
a consequence of which is the time independence of (3.9): 
CE (6, 1), O*(a, 1) = (8°, 0), e* ™Se- Xing, 0)) 
= (1°95 0). SO (a, 0)): 
Equation (3.13), it should be noted, may be interpreted as a statement of 
energy conservation between the in-state ®'" and the out-state ‘¥"". 


If we substitute (2.6+) and (2.9+ ) into the left-hand side and right-hand 
side of (3.6+), respectively, we obtain 


(3.14) 


= [ae me OE) ba = =/ dE e~*¥'@(E)Q* | Ea). 


Since $(E) may be any well-behaved function, it follows that 

OQ" Ea — |Ea™ >. (3.15+) 
Equations (2.6 —), (2.9—), and (3.6—) may be used in a similar manner to 
conclude 

Caiea = |ra~ >: (3.15—) 


Since we have assumed the system is asymptotically complete [Equation 
(1.19)], we may conclude that the Moller operators Q* and Q” map the 
space of physical states # onto the space of scattering states #..4,: 


O70 + .. (3.16) 


In other words, the domain of definition of both Q* and Q” is the entire 
space ¥, but the range of Q* and Q” is the (proper if bound states exist) 
subspace 4 .car- 


[An operator A is said to be isometric if it preserves the norm of 
vectors, i.e., if 


(Wy, W) = (Ap, A) = (W, ATA) (3.17) 
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for all vectors w. An equivalent definition is that A satisfies the 


condition 
AeA (3.18) 
If A satisfies the stronger condition 
A A= I vand AA) —7 (3.19) 


then A is unitary. A unitary operator A is defined on all of # and 
has all of W as its range, Le., maps # onto # in a one-to-one 
fashion. A unitary operator is necessarily isometric, but not vice 
versa. | 


Because 0? H = H,,, is usually not all of #, we should not expect the 
Moller operators to be unitary; they are, however, isometric, a fact we shall 
now show. 

By inserting 


©*(t’) =e iH(t’ —9@* (t) 
[which follows from (2.9+ )] into (2.18 +), we obtain 


+ 00 
o'(1) = OT (t) — | dt' Gg(t — t')Ve~#"-%@* (t) 


= ( + if ° dt'Go(t —t)VG (tt - nor (3.20) 


- (1+ i| dt” Go 1 VG-(+0)) 0° 


The second line follows from the definition (2.35—) of G-(t) and the fact 
that Gg (t — t') is nonzero only for t — t’ > 0. From the definitions (2.12 +) 
and (2.35+) of the Green’s functions and from the Hermiticity of K and H 
it easily follows that 


Gs(-t)' = GE (3;2lac) 
and 
Gt(t)t = G*(—2). G2ZIb-+) 


Upon use of (3.21+) and the definition (3.7+) of Q*, Equation (3.20) 
becomes 


oo it 
P(t) = ( a, | de" G1 VG5(0) © (j= ORO). Goren 


But ®*(t) = Q* '(t), so 
O'(t) = Q*tQ*+Hi(2). 
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We have not made use of any property of ®'"(t) except that it can be expanded 
in terms of the generalized eigenvectors |Ed> of K, so ®"(t) is an arbitrary 
element of #. Consequently, 


Q*ttQt = J, (3.23 +) 
i.e., 2 is isometric. A similar derivation gives 
Q-fTQ7 =]. (3.23 —) 


The failure of Q* to be unitary is now easily shown: By use of (3.15+) 
and (XIV.2.11) one obtains 


0-1 = (r at jay <ai}(5 at ia><a'l) 
= Di la*><ala’)<a'* | = Y’ |a*> <a*| (3.24+) 


= ica =. Tyna 
where 


Pl cat =o |a* » <a* | (3.25a) 


and 
era = 3 | a,» {On| (3.25b) 


are the projectors onto the subspaces of scattering states and bound states, 
HK cn, aNd Hyg, respectively. Unless the unitary deficiency Nynq is zero, 
i.e., unless there are no bound states |«,>, the operators Q* can only be 
isometric and not unitary. 

The adjoint Maller operators Q** map #,.a, back onto # ; more precisely, 
they annihilate the #,,4 part of # = Hyna ® A xq and map the rest, 
H on, back onto #: 


; 
O** 0, = (r a+ ia <a'i la.» =D 1a'><a’*|a,> = 0, (3.264) 


or equivalently, 
O*'Tpna = 0; (3.26'+) 
and 
OF'\a*> =] 0210 |a> = I|a>='\a>. (3.27+) 
We are now in a position to show that S is unitary: 
SSt = (Q71Q*) Qt) = 0°10 — WgpngQ”™ = 7 (3.28a) 
and 


Sts = (Q*tN-)(Q-10*) = O70 — MyygQ* =I. — (3.28b) 
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In writing Equation (3.28) we have used the definition (3.10) of S, the iso- 
metricity (3.23+) of Q*, and Equations (3.24+) and (3.26+); in order for 
Equation (3.28) to make sense—indeed, for the definition (3.10) of S to make 
sense—we had to assume domain Q** = range Q*' = Q'#, or, in other 
words, asymptotic completeness in the form (3.16). In this way we have 
proven that the S-operator defined by (3.10) and (3.7 +) in terms of V and K 
is unitary. 
We define the S-matrix to be the matrix {S,,} of the scattering operator 
with respect to the K basis: 
Sea = (alS|a’) = <alQ~*Q* Ja’) = <a" |a’*). (3.10’) 
By using the relation (1.24) we may write S,,, either as 
Soa = (<a*| — 2ni<a| VO(E, — H)]\a'*> (3.29) 
= a emote) — Ey) aaa 


or as 
Sag = <a ala 9) — 220(b = A)Via) 
=a |a' > — 2nid(E, — E,)<a |Vla’>. 
If we use (XIV.2.9b) (which we shall finally prove at the end of this section), 


we see that the first terms on the right-hand sides of (3.29) and (3.30) are 
equal. Thus 


(3.30) 


a\Vla*> = <a |Via) when E, = E, = E, 


1.€., 
<EalV\ Ea => = (Ede |V | ka: G31) 
In Section XIV.5 we introduced the on-the-energy-shell T-matrix 
oo (Ey = <Ea|V | Ea > (3.32) 


(which does not define an operator) and its off-the-energy-shell extension 
Toy = <alT* |a> = <alV ila" = <E,a'|\V|E,a’ >, (3.33) 


which defines an operator T *. Equation (3.31) suggests a different extension 
of the T-matrix, namely 


Tay = <alT la = Ca |Vla’> = KE@ |VIE, a), (3.34) 


which defines the operator T-. {T;,.} and {T,j,-} are extension of {T,,(E)} 
in the sense that 


Ti = eg (ewe bn ek (3333) 
The S-matrix element S,,, may then be expressed as 
Sea = (ala) — 2ni(E, — Ey) Toa(E,) 
= P(E.) “Og (Eg — Eq) — 2n0(E, ~ E,)T,,(E) (G26) 
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Note that it is the on-the-energy-shell T-matrix elements that contribute to 
the S-matrix; matrix elements of S between generalized eigenvectors |E,@a> 
and |E,@’» vanish for E, # E,.. 

An integral equation for the T*-matrix may be obtained by inserting the 
Lippman-Schwinger equation (1.1+) into (3.33): 


| F V , } 


Un 
pao | 


<a|V|a"><a"|Vla'*> (3.37) 


= <a|Vla’> + 
Sad » Ea ae + 10 


<a|V|a">T¢ 
aly |a> +) —— 
aul’, LE DE. +10 


Equation (3.37) may be iterated by repeated substitution of the right-hand 
side into itself: 


<alVla"<a"|Vla> 


fos = V ' : 
ee ap (3.38) 

The Born approximation 
Ti © (al|V(a’> = <E,4|V|E,4@), (3.39) 


in which only the first term of the iterative solution (3.38) is kept, is usually 
adequate in the case of high energies E, = E, = E and a weak interaction V. 
From the unitarity of the operator S it follows that the S-matrix is also 


unitary: 
Ss Saa” Sea” ee ys StaSara Sai Osa (3.40) 


or in more detail, 


y { dE yr py Eqr)(Eq4|S|Eqr "> (Eq G'|S|Egrd”>* 


= | dB e-bo(Ea)Eo-d"S| Bud)" (Ey a"|S\Ed’> (3.40) 


- P(E)” Oa O(E, ia E,;). 


By inserting (3.40’) into (3.36) we may express the unitarity of {S,,} in 
terms of the matrix elements T,,(E): 


Tyg(E) — T$AE) = — 201), Pel E)Taw(E)T 3 e(E). (3.41) 


The unitarity condition as expressed by (3.41) is sometimes called the 


generalized optical theorem. | 
In summary, we have obtained both the S matrix and the T matrix by 
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assuming the axiom V of continuous time development. We have expressed 
the S-matrix and T-matrix in terms of quantities, like Vand |a™ >, that depend 
on the existence of the time-development generator H ([cf. (3.32) and (3.36)]. 
The generalized eigenvectors |Ea*> of H and the corresponding states 
|@*(d, t)> describe the situation at the time of collision; they are therefore 
quantities that are never accessible to experimental determination. In 
contrast, the generalized eigenvectors |Ed> of K and the corresponding 
states |'"(d, 1)) and |®™(4, t)> are experimentally accessible by means of 
the initial preparation and final observation of a collision experiment. They, 
together with the operator that describes the transition between them, are 
indispensable in the theoretical description of a collision experiment; 
whereas experimentally nonaccessible quantities like |\@*(d, t)> are theor- 
etical tools and rest on additional theoretical assumptions (e.g., the axiom 
V). 

The theoretical description which is based only on the experimentally 
accessible quantities is called S-matrix theory. The S-matrix and the on- 
energy-shell T-matrix, which are connected by way of (3.36), are then the 
fundamental physical quantities; furthermore, the only assumption entering 
into (3.36) is the conservation of energy between the in-state and the out-state 
({S, K] = 0). Unitarity, which implies the condition (3.41) on the T-matrix, 
is a separate fundamental assumption in S-matrix theory and is justified by 
the requirement of conservation of overall probability. Another fundamental 
assumption in S-matrix theory is the analyticity of the T-matrix as a function 
of energy and other physical parameters. This is argued to be connected 
with causality. (We shall discuss this connection in Section XVIII.4.) In 
the conventional formulation based on the assumption of a continuous time 
development, properties such as unitarity and analyticity (in a certain 
domain) are derived from the properties of H and of V. 

Symmetry considerations place the most significant conditions on the 
S-matrix. In the conventional formulation symmetries are formulated as 
transformation properties of V and H under various symmetry transforma- 
tions, and these transformation properties then lead to derived symmetry 
properties of the S-matrix. (We shall discuss the consequences of rotational 
symmetry in Chapter XVI.) In S-matrix theory the symmetries are formulated 
directly in terms of symmetries of the S-matrix or in terms of transformation 
properties of the T-operator. 

If V and H are known, then the conventional approach to the description 
of collision processes is more “fundamental,” although based on more 
assumptions. After Equation (3.37) has been solved for the T-matrix, one 
may predict experimental results. As mentioned in previous chapters, in 
physics we most often have the reverse situation: One wants to conjecture 
from experimental data (e.g., cross sections) the properties of the fundamental 
observables and then use these properties to make predictions about other 
experimental quantities. In this situation a formulation of the properties in 
terms of the T-operator is just as valid as a formulation in terms of V, and 
is often more practical. 
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XV.A_ Appendix 


With the help of the isometricity (3.23+) of the Moller operators, Q*tQ+ 
= I, it is now trivial to prove two equalities that were used in Chapter XIV. 
(The proof of isometricity did not depend upon these equalities.) Equation 
(XIV.2.9b) follows from (3.15 +) and isometricity: 


¢a* |a’*> = (<a|Q**)(Q* la’) 
= <a|(Q*10*)|a’) = Cala’. 
To prove (XIV.5.21) we use (3.15+) and (3.8+) together with isometricity: 


<a" |W*(Ola’™> = (Kal Q*1)(Q* Wit)0*1)Q* ay) 
= al(Q*1Q*) W(t) (CQ*tO*)ia> — (A.2+) 


(A.1) 


= <a|W™(t)|a’>; 
in a similar fashion one can prove 
<b-|W~(t)|b'~> = <b, Wo") |b’. (A.2—) 


Lastly we show that the term II given by (XIV.2.17) vanishes. The key to 
doing so is Equation (A.2+). If we substitute (XIV.2.9b) and (A.2+) at 
t = 0 into (XIV.2.17), then 

I = 9) Ye Me" Feb] V [a p(Ey) '6a55(Ea — E,)<a| W|a’> 


b aa’ 


= [p(E,) db, ¥, | p(E,) dB, p(E,) dE, Yet Fo 
5 aa’ 
x (E,b|V|E,4>p(Ey)” '6a5(Eg — E,)<E,4|W'"|E,-d’> 
= [r(E,) dk, ¥, [o(E,) dE, Ye -F"CE,6|V EA) 
6 a 


x (E,4|W'"|E,5> 
= VY ete 5| Va) <a] Wi"|b». (A.3) 
boa 
But the summation )\, is only over those values of b for which |b>€ A#, 


and by hypothesis the initial state W'" is so prepared that | A> ¢ A# for all 
states | A> that give a nonzero contribution (w, # 0) to 


Wi = NEE 
Therefore 
WwW" |b> =0 
for all b in the summation, so that 
II = 0. (A.4) 


The contribution of IJ both to the transition rate d¢A)>,/dt of (XTV.2.5) 
and to the transition probability (A),, of (XIV.5.2) is then, of course, zero. 


CHAPTER XVI 


Elastic and Inelastic Scattering 
for Spherically Symmetric 
Interactions 


In this chapter we derive general results for spherically symmetric situations, 
i.e., for situations when in addition to the free Hamiltonian K, the interaction 
Hamiltonian V and the transition operator T also commute with the operator 
of angular momentum. For the sake of simplicity we consider the spin-zero 
case. In Section XVI.1, the partial-wave expansion is discussed and the partial 
cross sections are derived. In Section XVI.2, the phase shift is introduced as 
a consequence of the unitarity of the S-matrix. In Section XVI.3, a graphical 
representation of the partial-wave amplitude, the Argand diagram, is 
introduced. 


XVI.1 Partial-Wave Expansion 


Many scattering problems are spherically symmetric. This means that both 
H=K+V and V commute with the (orbital) angular-momentum 
operators L,; or, if the scattering is described by a transition operator T, that 
T (and hence the scattering operator S) commutes with the L;’s: 


iV, Ei 0, Lee = 0: (1.1) 
Then 

GLE od ea pe (1.2a) 
and 

(ELS La), (1.2b) 
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where 7°? and x°? stand for the internal observables (their eigenvalues 7 and K 
being the internal quantum numbers), are both c.s.c.0.’s, and one may choose 
collections of generalized eigenvectors 


|E,! 130) (1.3a) 
and 
|E,L 137 > (1.3b) 


of these systems as bases. 

If LV, 4°?] = 0 and therefore [H, °°] = 0, then the collection of opera- 
tors x°? in (1.2b) can be chosen to be n°, and the generalized eigenvectors 
(1.3b) are eigenvectors of the c.s.c.o. (1.2b). In general [V, 7°?] 4 0, and 
then (1.3b) are not eigenvectors of the c.s.c.o. (1.2b) and the meaning of the 
label H is defined through the Lippman-Schwinger equation as explained 
in Section XV.1. 

Stil another c.s.c.o. consists of the projectile momentum P (or the momen- 
tum P in the center-of-mass system when the target is not fixed) and the 
operators for the internal quantum numbers °°. 


{eit >, Ps. 7°}. (1.4) 
The corresponding basis consists of the generalized momentum eigenvectors 
Ip1>. . (1.5) 


The additional quantum numbers yn = (y,42---n,) may refer to the 
internal structure of the target, of the projectile, or of both. For example, 
if the target is a hydrogen atom, then H will contain the quantum numbers 
n, j, /3, and 7 that characterize the state of the electron in the hydrogen atom 
(cf. Chapters VI and IX). According to (1.2), [L;, 7°?] = 0; thus we exclude 
the possibility that 7 includes the quantum numbers of an internal angular 
momentum. Then the total angular momentum is equal to the orbital 
angular momentum L,, and we avoid the complication of coupling the 
internal angular momentum with the orbital angular momentum to get the 
total angular momentum. Most processes encountered in experiment involve 
at least one internal angular momentum; however, to understand the princi- 
pal problems of this chapter it is easier if one omits at the beginning the 
inessential complications introduced by the coupling of angular momenta. 

The simplest case we can discuss is the case of a structureless projectile 
incident on a structureless target. Whether the target and/or projectile may be 
considered to be structureless depends on the energies involved. If, for 
example, the target is a helium atom, then it may be considered structureless 
for projectile energies less than the energy of the first excited state, and the 
quantum numbers y are unnecessary. If the projectile energy is comparable 
in size to the difference between the energy levels, then the target’s structure 
must be considered, and y will contain the quantum numbers n, j, j;, 7 
(singlet) or n, j,j3, 7 (triplet) of the helium atom. 

We would like to express the matrix element 


<Pone|T |Pana> (1.6) 
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of the transition operator T with respect to the momentum basis (1.5) in 
terms of the matrix elements ‘of T with respect to the angular-momentum 
basis (1.3a). To do this we expand |p 7 in terms of the angular-momentum 
basis, 

Ipn> =, [o,(E) aE1E Us m><E Ua nip n> (17) 

U3 

The determination of the transition coefficients <py|E/l; > = 
Ell, n|pn>* is a problem similar to the determination of the wave 
functions <x|n1m) of the hydrogen atom, which was discussed in Section 
VII.3. Instead of (VII.3.5) we have the analogue 


1 a 
<pal|Lily> = — | GimPi 5p <P O,%rnlW>, (1.8) 


from which follow the analogue of (VII.3.6,) and (VII.3.6’). Here (p 6, ¢,) 
are the spherical coordinates of the momentum vector p. Instead of the 
“radial equation” (VII.3.7) we have 


p? 
<p obo ag a Km )IET n> = E¢p 0,¢,n\E lls n>, (19) 


which is a consequence of the (nonrelativisitic) relation (cf. XIV.5.31) 
PA 


~ 2M(n°") 


between the energy operator K, the momentum P, and the internal energy 
Ki". Although the point has not been raised previously, the mass of the 
particle may depend upon the internal quantum numbers 1 of the system; 
this dependence allows us to consider cases where the outgoing particle is 
different from the incoming particle. From (1.9) and the analogue of 
(VII.3.6’) and (VII.3.63) it follows that the transition coefficients have the 
form [cf. (VII.3.11)] 


KG?) (1.10) 


2 


P int a 
2m, E ¥u,(0,.,) 11) 


The function f,(p) is determined (up to a phase factor) by the normalization 
(XIV.2.9a) of the generalized K-eigenvectors | E/1, 7>, for 


pA E)- 16(E = E’)6 yy 6151, = Ell, n|E'l'l, 


<P Opp nlE lls n> = (0) E = 


= [rc Us npn cen I'l n> 


2 
= |e sin Os dp dé, do, fF (P) é (e = Le ~ a) 
2m, 


2m 


n 


= mPAE)| f(Pa EOE — EV) di51,- (1.12) 


7) 
x Yii (Ce P p) fn(P)O ( = es a Es) Y,1,(9,, Pp) 
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In the last step we have used the fact that if g(x) has only simple zeros x,, 
then 


OS) =X) 
\g'(x;)| 


the quantity p,(E) that appears on the right-hand side of (1.12) is therefore 
the positive zero of the equation 


d(g(x)) = ¥ (1.13) 


PA 
p ; 
Eee Sepa 
ee 0 


oT 
Levis 
p,(E) = ./2m,(E — Ei”). (1.14) 
Equation (1.12), together with 
p,AE) = m, p,(E) (1.15) 
of (XIV.5.18a), gives 
| f,(PAE))| = p(E)~*, (1.16) 


so that with an appropriate choice of phase for the generalized eigenvectors 
|p >, the transition coefficients (1.11) are given by 


2 
<P 9, G,n|E llsn> = (p,(E))~ ale = - = zs Yn,(9,,9,)- (1.17) 


Equation (1.1) tells us that T is a scalar operator; its matrix elements with 
respect to the angular-momentum basis are then [using the Wigner—Eckart 
theorem (V.3.6)] 


CEllsm|/TIE' lly n4> = CU 00|11,><E lm TIE l n> 
as Oy O141,6E ln TIE’ Ina. (1.18) 
Use of (1.7), (1.17), and (1.18) in (1.6) gives 
= ’ : Py int , PA int 
<Pots|TIPatla> = y | de de o(e a )o(e - ee 
x CEln||TWE' lna> Yi ,Q4) Yh, (Q4), (1.19) 


where Q, = p,/|P4| = (04, &4) is the direction of the incident particle and 
Q, = p,/|P»| = (A;, &,) is the direction of the detected particle. If we perform 
the summation )’,, by using the addition theorem (VII.3.22) 


y ¥,,(Q)Y#(Q,) = a5 P(Q, -2,) (1.20) 


Ilg=-l 


for spherical harmonics, then (1.19) becomes 


1 
<Pp Mol T Pana? = a (21 + 1)P(cos 0)<E, ll TIExina>, (2D 
1 
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where @ is the angle between 2, and Q, (cf. Figure XIV.3.1), and where’ 


P4 
Eq = Eats) = 3 + En (1.22a) 
Pp 
E, = E(p,, ™) = Pe + Ein, (1.22b) 


The reduced matrix element <E,/!n,T*||E4/n4> depends upon the 
energy and the internal quantum numbers y, and 77,4, but does not depend 
upon the direction. As one would have expected for a spherically symmetric 
problem in which the initial conditions distinguish only the direction Q,, 
the T-matrix element (1.21) does not depend upon the angle ¢ around the 
direction Q,. 

Equation (1.21) proves the statement preceding and justifying the definition 
(XIV.5.63). Written in terms of the scattering amplitude, (1.21) becomes, for 
ny = na and E, = Ey, 


T(pa, 9) = ¥ (21 + 1)P(cos 0)(—mm)KE gl nal|TEstna> (1.21) 


In the expressions for the cross section [e.g., (XIV.5.38), (XIV.5.50), and 
(XIV.5.60)] and in any other physical quantity, the factor o(E, — E a) 
appears as a consequence of energy conservation [cf. (XV.3.13)]. Con- 
sequently only the reduced matrix elements 


<n T(E a lina> = (Ey = Ea! m||T Esl nad (1.23) 
on the energy shell E, = E, have physical meaning. Energy conservation 
for the nonrelativistic case is given by 

pi /2m, + Ett = E, = E, = pi/2m, + Ee. (1.24) 


NA Nb 


In nonrelativistic coflision experiments one usually has m, = m,, but does 
not have Ei = Ei" except in the case of elastic scattering. In the following 
we shall examine in detail the case in which the detected particle is the same 
as the incident particle, so that m, = m,; first, however, we give the expres- 
sions for the cross section in the general (spinless) case. 

The differential cross section per unit solid angle for scattering from an 
initial state with momentum p, and internal quantum numbers 7, into a 
final state with internal quantum numbers n, is [cf. (XTV.5.60) or (XTV.5.61') 
at the end of Section XIV.4] 


do” m,zM, 


io = Cn)" | Po(Es) dE! EO onol T PaOaPattad? 
b Pa 


xO EE). (1.25) 


' Note the change in notation: E , here denotes the total energy, whereas in Section XIV.5 E4 
denoted the projectile (kinetic) energy. 
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By using (1.21) we may rewrite do”/dQ, in terms of the reduced matrix 
elements (1.23): 


do" 2 m4m, 
St 
dQ, 


(cos 6)<ny|| T(E s)\Ina>|?. (1.26) 


The dependence of the momentum p, = p,(E,, n,) of the detected particle on 


E,(=£,) and n, is 
PAE 4) = ./2m,(E, — Ei), (1.27) 


as determined by (1.24) or (1.14). 

The differential cross section given by (1.26) may be expressed as a series 
in the Legendre polynomials P,, but the result is complicated (this will be 
done in Section XVII.8, where it is applied to the phase shift analysis). We 
therefore give here only the expression for the total cross section, which may 
be expressed simply in terms of the reduced matrix elements. By using the 
orthogonality property 


[Por =a 545 (1.28) 


of the Legendre polynomials one obtains 
do™ 
m = |dQ, — 
id =| > 70, 


= 4m? 4 (E,) Y(2 + Dini TEDta>? (1.29) 


for the total cross section. This suggests that we define the /th partial cross 
section 


3 N4m, 


of = n> —*— p,(E,) (21 + D1 <nel TEs inal? (1.30) 


and that we write the total cross section as 
Gi) ope (1.31) 
t 


af’ and o” in (1.30) and (1.31) are the partial and total cross sections, 
respectively, for scattering from a state with internal quantum numbers 
7 =n, into a state with internal quantum numbers y = n,. Often the 
detector registers not just those states with a well-defined value ny, of 1 
but all states with values of 7 within some range. This happens, for example, 
if 7 are the internal quantum numbers of the target and the detector registers 
scattered projectiles with any possible energy. The projectile may excite the 
target into any internal state 4 by losing the corresponding amount of 
kinetic energy, and the detector cannot distinguish between elastically 
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scattered projectiles and inelastically scattered projectiles. In such a case the 
Ith partial cross section is given by 


= Loft = 4? Al + DY EM wl TEolnar?, (132) 


Nb A Nb 


and the total cross section is given by 


C=) Gg? =) 0, (1.33) 
Nb I nol 

Instead of the reduced matrix elements <1,|| T(E 4)||N4> it is convenient 
to use the “partial-wave amplitudes.” They are commonly used in nonrela- 
tivistic scattering theory (particularly in the case of elastic scattering), and 
formulas written in terms of them can be immediately taken over to the 
relativistic theory. One proceeds from the assumption that the initial state 
is fixed by the internal quantum numbers n,, which usually describes the 
state of lowest energy. From the reduced matrix elements one defines the 
partial-wave amplitudes by” 


TRACE 4) = T?(p4) = —7./m,m, <np| T(E 4)\ln4>> (1.34) 


where p, is given in terms of E, and n, by (1.22a) and in terms of py, Eee 
and Ei" by (1.24). No additional quantum numbers », are required to de- 


scribe elastic scattering, for which y, = n4, and one simply writes® 


T(p4) = a (Da) = —T/M,mM, <nall T(E 4)\\n4>- (1.35) 


The partial-wave amplitude T7*(p,) for the inelastic case (7, #4) 1s 
often called the partial-wave reaction amplitude, in distinction to the partial- 
wave amplitude T,(p,4) for elastic scattering. In terms of the partial-wave 
amplitudes the differential cross section (1.26) is written 

do” _ PrlEa) q 


ON ae Bg laces O)T?*(p)| ; (1.36) 


while the /th partial cross section (1.30) is written 
E 
oT = 4n ee (21 + 1)|T@(p,)|?. (1.37) 
A 
The /th partial cross section (1.32) for elastic scattering is then 
o,(elastic) = of4 = 4n(21 + 1)| T(p,)|’, (1.38) 
and the /th partial cross section for inelastic scattering is 


: p an 
g,Gnelastic) = > of a +1) ¥ pkEOIT Pew. (39) 
A 


No eNA NbtNA 


2 See footnote 3 below. 


E We shall suppress the lable y, designating the internal properties of the intial state, when- 
ever this does not obfuscate the notation. 
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The final momentum p,(E,) that appears in the above equations may be 
expressed in terms of the initial momentum and the difference between the 
initial and final internal energies by 


P(E4)"/2m, = pa/2m, — (E he 15) (1.40) 
The S-matrix is given in terms of the (on-the-energy-shell) T-matrix by* 
CE, 113 my |S| Eqs n> = (Ey lls ml Egl nad 
= 2710(E, — E,)<E,li,n,|T|E,l in. GA) 


Spherical symmetry, which was initially stated by (1.1), is equivalently stated 
by 


[S, L,] = 0. (1.42) 


S is thus a scalar operator and by the Wigner-Eckart theorem has matrix 
elements of the form 


CE, 11; 4,|S|E4 lly 44> = On Onn, CE, 1 my |S\|E 4! n>. (1.43) 
From energy conservation [Equation (XV.3.13)], 
so) (1.44) 
it follows that 
CE, ly |SEsina> = OE, — E4)p (Es) t Pulea) 2m S(E4)llna>> 
(1.45) 


where p,(E) is given by (1.14) and (1.15). The factoring off of the weight 
functions p,''? and p,‘? is a convention made in the definition of 
<y,iS(E,4)\in4> [but not in definition of <7,\| T(E4)||/n4] in order to simplify 
later expressions (e.g., the unitarity condition in terms of the S- and T- 
matrices, which then can be carried over directly to the relativistic case). 
After substitution of (1.18) and (1.23) and of (1.43) and (1.45), the connection 
(1.41) between the reduced matrix elements of S and T becomes 


<nollS(Ea)itta> = Onna — 2tin/ PEP aE a)<noll Ean a> (1.46) 
In terms of the partial-wave amplitudes this connection is given by 
S(E4) = <nalS(Ea)\lna> = 1 + 2ip, Ti(pa) (1.47) 


for elastic scattering and by 


Km S(Slln4> = 2i/ P(E a)Pa T?(Pa) (m, # Na) (1.48) 


for inelastic reaction processes. 


4 The S- and T-operators were discussed in more detail in Section XV.3, and (1.41) is identical 
with (XV.3.36). If Chapter XV was skipped in the first reading. the reader may take (1.41) as the 
definition of the S-matrix. 
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XVI.2 Unitarity and Phase Shifts 


In the previous section we expressed the cross section in terms of the transition 
matrix and also in terms of the partial-wave amplitudes [Equations (1.25) 
and (1.36)]. The only condition used was the assumption (1.1) of spherical 
symmetry. This allowed us to write the cross section in terms of some known 
functions of the scattering angle and the partial-wave amplitudes. For a 
given scattering problem the partial-wave amplitudes can be determined 
once the potential is known, in the same way that the scattering amplitude 
can be determined from the potential, by solving the integral equation 
(XV.3.37). There is, however a general condition that permits us to obtain 
some general properties of the scattering amplitude. This is the unitarity 
of the S-operator, which is an expression of the conservation of overall 
probability and which is a consequence (for the case that time development 
is described by a Hamiltonian, as specified by the axiom V) of the Hermiticity 
of the Hamiltonian. 
The unitarity of the scattering operator, 


Sis =a) (and SS' = I), Qi 
is written in the angular-momentum basis as 


S Prl Ep) dE,<E l L nl S*\E, 113 ny> CE, L ls m|S\ Eat 13 14> 


lym, 
= (E11, AEA ty nad = pA Es) OE — Ey)5ir 55, 55q4- (2.2) 


Using (1.43) and (1.45), this leads to the unitarity condition for the reduced 
S-matrix elements: 


ay Cm SCE A>* <n SKE aina> = Oana: (2.3) 
Nb ‘. 
For the special case 7 = 44 Equation (2.3) may be written as 
SKE? + Y inl S(Edlina> ? = 1, (2.4) 
Nb FMA 


where _ extends over all possible values of n, except n, = n4. Inserting 
(1.47) and (1.48) into (2.4) leads to the unitarity condition expressed in 
terms of the partial-wave amplitudes: 


Im T,(p4) = » PAE 4)| TP(pa)l? 


= palTi(pal? + » p(EalTI(pa)l?. (2.5) 

NbFNA 
Equation (2.5) is the partial-wave amplitude form of the “ generalized optical 
theorem” (XV.3.41). The first term on the right-hand side corresponds to 
elastic scattering ; the second term corresponds to inelastic reaction processes. 
For a particular collision experiment the values of y, over which the sum 
in (2.3) and (2.5) extends may be limited to those permitted by the initial 
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energy. If the kinetic energy of the projectile is lower than the energy necessary 
to excite the target, ie., if 


pa/2m, < Ee ms aA (ny ~ 14) 


(for an “infinitely heavy” target that remains always at rest), then the pro- 
jectile can only be scattered elastically, and all <n l|S(E4)\ln4> vanish for 
N» # N4- One says that only the “elastic channel” is open. For instance, if 
in e — H scattering the kinetic energy p*/2m of the incident electron is lower 
than the energy difference between the ground state and the first excited 
state, E(n = 1) — E(n = 2), then only elastic scattering 


e+H-e+H 


is energetically possible, so only the elastic channel is open. As the kinetic 
energy of the incident beam is increased, one reaches a point above which 
it is energetically possible to raise the target into the first excited state n, 
with the projectile losing a corresponding amount of kinetic energy. This 
point is the threshold of an inelastic process. The threshold momentum for 
an infinitely heavy target is given by 


Ps — int int 
Pthresh/2M 4 — Ey at EY : 


Above this value for p,, both <yy4|/S(E4)\ln4> and <y,||S(E,)\ln4> are 
different from zero. One says that an “inelastic channel” has opened up. 
As the incident energy is further increased, other inelastic processes may 
become possible, and an increasing number of reduced S-matrix elements 
in (2.4) become nonzero. 

There can be different kinds of inelastic channels. The simplest are the 
excitation channels, in which the internal state of the projectile (target) 
remains unchanged whereas the target (projectile) performs a transition 
from the initial state into an excited state. 7, and n, then refer to the dif- 
ferent internal quantum numbers of the target (projectile). (If the projectile 
is also a system with internal structure and its internal quantum numbers are 
contained in the set 7, then these quantum numbers will remain fixed.) In 
the example of e-H scattering the excitation channels are the final states of 
the process 


e+ Hin = 1)>e + H*(n = 2). 


The threshold energy for this process is| E(n = 1) — E(n = 2)|. Ifthe incident 
energy is increased further, the target may ionize, 1e., the target may be 
raised to the energy at which it breaks up into two particles; one then says 
that the “ionization channel” has opened up. For e-H scattering the ioniza- 
tion channel is the final state of the process 


e+H(in=1)>e+e+H"* 


and the ionization threshold is |E(n = 1) — E(n> x)| = 13.53 eV. 
The internal quantum numbers ,, or at least a subset of them, now also 
extend over a continuous set of values. In these cases it is often more practical 
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to use for the postcollision system a different set.of quantum numbers than 
for the precollision system. This becomes even more necessary for more 
complicated collision processes like rearrangement collision, in which the 
projectile is absorbed by the target and another particle is emitted. A simple 
example of such a process in atomic physics is 


y + He > He* +e. 


For more complicated precollision systems the number of possible scattering 
channels increases—e.g., in the collision of He atoms by protons, where the 
following channels are possible: 


Ht + He > H* + He (elastic channel) 
— H* + He* (excitation channels) 
+ H+ + He* +e (ionization channel) 
+ H+ + Het* +e (ionization-excitation channels) 
— H + He* (rearrangement channel) 
— H* + He* (rearrangement-excitation channels) 


Many-channel processes occur also in nuclear physics and in particle 
physics, and it is in these areas that many-channel problems become particu- 
larly important. For instance in pion-proton (7-p) scattering one has (among 
others) the following channels: 


Tt aD > dy ta (elastic channel) 
on +n°+p 
>Hy+n 
+K°+A 
Se Behe 


(n and K° denote mesons heavier than x; n and A denote the neutron and the 
“strange” baryon A, which is heavier than the proton and the neutron; 
p* denotes various excited states of the proton.) 

All scattering channels (i.e., the elastic channel and all inelastic channels) 
are simultaneously open if the initial energy is high enough. These scattering 
channels are various final states, differing in the internal quantum numbers 
n,, and the sums in (2.4) and (2.5) must be extended over all channels that 
are open, i.e., energetically possible. The interaction can, of course, be such 
that an open channel, say 4;, does not contribute much to a particular 
partial-wave amplitude, ie., that |<n5||S,CE4)ii74>| is small or zero even for 
energies above the threshold for the channel nj. These are detailed properties 
of the interaction (“the dynamics”), which require further information. 

We initially assume that all inelastic channels are closed. Equation (2.4) 
then becomes 


|S(E,)|? = 1, (2.6) 
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For elastic scattering it thus follows that S,E,) is a function of E, (or, 
equivalently, of p4) of modulus 1 and consequently may be written 


DiEapetewe (2.7) 


6,(E4) is a real-valued function defined up to an integral multiple of x by 
(2.7). 6(E 4) is called the scattering phase shift, for reasons we will meet later. 

The elastic partial-wave amplitude that appears in (1.47) may also be 
expressed in terms of the phase shift, 


Spa) — 1 ; 
T, SES a 2idi(p) —S 
(Pa) 2ip, ip, (e 1) 


ee 
= — e%P4) sin 5,(p4) = 1 tan 0(p,) 


Da pa 1 — itan 6(p,) 
4 1 
Pa cot 0(p4) — i 


which in turn allows us to express the /th partial elastic cross section of (1.38) 
in terms of the phase shift: 


(2.8) 


ee 
o(elastic) = 3 2 + 1) sin? 5,(p,). = (29) 
A 


We would like to extend the phase-shift formalism to include the inelastic 
or reaction channels. If the sum over the reaction channels in (2.4) is nonzero 
(i.e., if E, is sufficiently high that some of the reaction channels are open), 
then S,(E,) can no longer be written as in (2.7) with real 6,(E,); but it may be 
parametrized in the form® 


S(E4) = n(E,e?"@) (2.10) 
with real 6,(E,) and with real y,(E 4) satisfying 
0<n(E,) < 1. (2.11) 
That n, satisfies (2.11) may be seen by inserting (2.10) into (2.4), 


n(Eay =1— Yo KXnllS(Eadlina>l?, (2.12) 


NbFNA 
and observing that 


Y <nolS(Ea)iina> 1? 


NbtNA 


5 The n, that appears in (2.10) is not to be confused with our use of 7 to denote the internal 
quantum numbers. We use it here because this notation of 6, and 1, 1s often used in particle 
physics: in other areas of physics the notation may be different ¢.g., in atomic physics 1, is 
often the symbol for the phase shift denoted by 6, here. 
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cannot be larger than 1. Equation (2.12) can be written in terms of the partial- 
wave reaction amplitudes by using (1.48): 


n{E4)? =1—4pa > pE4)| TP(pa)l. (2.13) 


NoFNA 


The partial-wave amplitude for the elastic channel may be expressed in 
terms of 5,(p4) and n(p4) as [cf. (1.47)] 


Si(p4) — 1 _ n(pae =|! 


T(pa) = 2ip, ‘ 2ip, 
_ Mpa) sin 26(p4) , 1 — pa) cos 20Pa) (9 1 
2DA 2P A 


From (1.38) and (2.14) it then follows that the /th partial elastic cross section 
is 


o(elastic) = Bl + 1)(1 + mp4)” — 2n{p.4) cos 26(p4)), (2.15) 
A 


while from (1.39) and (2.13) we have 
o,(inelastic) = Ol + 1)(1 — n(p4)2) (2.16) 
A 


for the /th partial inelastic cross section. The total /th partial cross section 
is the sum of these: 


o, = o(total) = = + 1)(1 — ,(p,) cos 26,(p,)). (2.17) 


For n, = 1 both (2.15) and (2.17) are identical with (2.9), which gives a, in 
the case that all inelastic channels are closed, and (2.16) vanishes. From this 
and from (2.10) and (2.11) one sees that 7, is a quantitative characterization 
of the amount of inelasticity involved in the scattering; , is therefore called 
the inelasticity coefficient. 

The total /th partial cross section may also be expressed in terms of the 
Ith elastic partial-wave amplitude by adding (1.38) and (1.39) and using the 
generalized optical theorem (2.5) to get 


4 
os = (21 + 1) Im T\(p,). (2.18) 
A 


If we sum (2.18) over all / and use the imaginary part at 6 = 0 of the 
elastic-scattering amplitude 


T(pa, 9) = > (21 + 1)P\(cos 0)T\(p4), Qils) 


we then obtain 


4n 4 
6= a= ¥ 21 + 1) Im T(p,) = = Im T(p4,9 = 9). (2.20) 


A 


Argand Diagrams 369 


{Also used was the property P,(1) = 1.] Equation (2.20), which connects 
the total cross section and the imaginary part of the forward-scattering 
amplitude, is usually called the optical theorem. 


XVI.3 Argand Diagrams 


In order to familiarize ourselves with the properties of the scattering 
amplitude and its behavior under certain conditions we consider a graphical 
representation in which p,7,(p,4) is regarded as a vector in the complex 
plane. These graphical representations are called Argand diagrams and play 
a particularly important role in the description and detection of resonances. 

We start from the expression (2.14) for the elastic partial wave amplitude, 
which is written in the form 


i i ; 
PaT\(pa) — Bee yi(p,je es), G.1) 


For the case of purely elastic scattering (e.g., in the region of low momentum 
where no inelastic channels are open) y,(p4) = 1, so pyT\(p,) — i/2 is a 
complex number that lies on a circle of radius 1/2. If 6,(p4) varies between 
0 < 6, < x(modulo 2) when p, is varied over the range in which only elastic 
scattering occurs, then all points of this circle are covered. In general, how- 
ever, 0(p,) will not vary over all of the interval 0 < 6, < x, and hence 
palp,4) — 1/2 will not vary over the full circle as p, is varied over that range 
in which only elastic scattering is possible. 
The values of 


i jee 
PaT((pa) = 5 ) gone) [n(pa) = 1] (3.2) 


lie on a circle with center at i/2 and radius 1/2. This is in agreement with the 
optical theorem (2.18), according to which Im T\(p4) = 0. The circle around 
i/2 with radius 1/2 is called the unitary circle. 

If there is absorption of energy by inelastic processes, then 


Pal pa) = : am 5 n(pajer tr) [n(pa) < 1], (3.3) 


and the value of p, 7;(p4) moves on a path inside the unitary circle as p, is 
varied. Figure 3.1 shows the graphical representation (Argand diagram) of 
paTpa). One calls the circle the “unitary circle” because, as a consequence 
of the unitary condition, which is expressed by (3.3), it follows that the path 
of p47(p4) must lie inside this circle. The exact path followed depends upon 
the particular interaction. The most important case of these diagrams is the 
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Im p4T, 


Unitary circle 


Re p,T, 


-4 0 +5 


Figure 3.1 The elastic-scattering amplitude p47,(p.4) in the complex 
plane. (Argand diagram). 


case in which the interaction between projectile and target is resonant. We 
will consider these cases below in Section XVIII.8. 

Argand diagrams may also be drawn for the partial-wave amplitude 
T’(p,4) of an inelastic channel (yn, # n4). One expresses the partial-wave 
amplitude T7°(p,) [or the reduced S-matrix element <y,||S,(E4)\\n4> that 
corresponds to it by (1.48)] in a form analogous to iG 


1 1 ss 
V PolEa)Pa TIP a) = 5 (ml SCE) Na> = oF nipe'?*". (34) 


Unitarity of S ((2.3) or.(2.4)) imposes the restrictions 


0 <ni(p4) < 1 (3.5a) 
and 
n(pay + ¥ ni(pay = 1. (3.5b) 
NbFNA 


In the Argand diagram, ./p,(E4)p477°(p4) then moves as a function of p4 
on a path that lies inside the circle with center at the origin and radius 1/2 
(cf. Figure 3.2). (The path would lie on the circle if there were neither elastic 
scattering nor inelastic scattering into any channel other than the one with 
quantum numbers n,.) 


© »,.7 4 denote the internal quantum numbers (channels). 7? denotes the inelasticity coefficient 
for channel n, as in Equation (3.4). 
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Imx 


Re a 


Figure 3.2 The amplitude x = ./p,(E4)p47/"(p,) for scattering from 
channel 7, to an inelastic channel 7,. In this case the center of the circle 
is at the origin and the amplitude is restricted to ||a| < 4. 


Problem 


1. (a) Show that the Born approximation violates unitarity. 
(b) Where inthe Argand diagram for elastic scattering can be Born approxima- 
tion be considered a good approximation? 


CHAPTER XVII 


Free and Exact Radial Wave 
Functions 


The present chapter treats the spatial properties of the scattering process as 
described by the wave function. After an introduction in Section XVII.2 the 
differential equation for the radial wave function (the Schrodinger equation) 
is derived. Section XVIJ.3 presents the solutions of the free radial wave 
equation, and lists some of their properties. The properties of the exact 
radial wave functions, in particular their asymptotic forms and their con- 
nection to the phase shifts and the S-matrix, are discussed in Section XVIL4. 
In Section XVIL5 the connection between bound states and poles of the 
S-matrix on the positive imaginary axis is established. Some material about 
functions of a complex variable, which is needed in this chapter and in 
Chapter XVIII, is reviewed in a mathematical appendix. 


XVII.1 Introduction 


In the preceding sections we discussed those properties of the scattering 
amplitude and the cross section that were consequences of very general 
assumptions like unitarity and spherical symmetry. We derived relations 
between the phase shifts, the inelasticity coefficients, the partial-wave am- 
plitudes, and the /th partial S-matrix elements. The cross section was ex- 
pressed in terms of these quantities. Nowhere in our discussion did we 
consider the spatial properties of the scattering process; in particular, we 
did not have to make use of position-dependent quantities such as the 
probability wave function. 
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The probability wave function, however, serves to provide a pictorial 
representation of the scattering process. In fact, it is in scattering theory 
that the wave function has a direct relation to observation, since it represents 
the probability for position measurements that can actually be performed. 
(For stationary states such measurements are not performed in practice, 
as has been discussed in the first half of the book; the wave function then is 
merely an auxiliary mathematical quantity.) 

In the present chapter we introduce the wave function. As in the previous 
chapters, we consider two different points of view: Firstly, we assume time 
development of the system as given by a Hamiltonian H = K + V. This will 
provide a “deeper” foundation for the relations between the more directly 
accessible quantities. We then see, however, that these relations have a more 
general validity. 

In addition to providing us with the wave function, this chapter will also 
establish the connection between our more general coordinate-free formula- 
tion of scattering theory and the usual formulation in terms of solutions of 
the Schrodinger wave equation. This latter formulation is directly obtained 
if the internal quantum numbers n are ignored. 

Though the Schrédinger differential equation constitutes a powerful tool 
in many cases (described by an interaction potential), it will turn out that 
the integral equation for the wave function (which is, in fact, just the Lippman— 
Schwinger equation in coordinate form) is of more immediate use. 


XVII.2 The Radial Wave Equation 


We start with the Lippman—Schwinger equation in the angular momentum 
basis 


| Ell;n* > = |Ellgn> + V | Ells" >. 


i 
E-—K+i0 
It is assumed that the position operator for projectile relative to target, Q, 
commutes with the internal observables n°" of the system: 


[O;, °°] = 0. 
By taking the scalar product of the Lippman-Schwinger equation with the 
generalized position eigenvector 
[x n> = |x1X2Xx3n') = |rOGn’>, 
we then get 


<rO4pn'|Ellsn "> = <rOdn! [Ellyn + <rO6nt | z-— pe VIEllsn* >. aN) 
where <r0on'|Ell,n> and <rO@n'|Ell;n*> are called the free and exact 
wave functions, respectively. 
After scattering, the projectile’s position x and momentum p have the same 
direction (see Figure XIV.3.1): 
x/|x| = p/|pl. (2.2) 
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Thus (0, ¢) are the angular coordinates of x as well as of p. On account of 
spherical symmetry one can write, using the same arguments that led to 
Equation (VII.3.11), 


Cron’ |Ellsn> = <O0|Il3> <rn'|En>, 


aa 19D) Ou <t | EDT, (2.3a) 
<r0gn'|Elln* > = <O$|Ils><rn'|En* >, 
= Y,,(0)>rn'|En* >i. (2.3b) 


The reduced matrix elements ./1/2<r| E>? and ./x/2<rn'|En* >, are called 
the free and exact radial wave functions respectively. 
Using the Wigner-Eckart theorem for the scalar operator V, one obtains 


<rOpn'|V|Elln*> = ¥ <rOgn' | Ellsfi* >< Ellyi* | V| Ellsn* > 


Ell3h 


= » <rOgn’ | Ell,q* > 6.61,7,¢E* |V|En* >, 


Elian 
=2 hi, &)<rn' | E* > <ER* || VIIEn* >, 
ji 
where <Ef* ||V||En* >, is the reduced matrix element of V. Thus 
<rOgn'|V|Ellsn*> = Yu,(0, 6)<rn'||V En >1, (2.3c’) 


where we have defined the reduced transition matrix element <rn’||V||En* >, 
in terms of the reduced matrix elements of V by 


<rn'\|VIEn* >, = ¥ <rn' |Efi* >< En* || VIER". 
En 


The reduced transition matrix element is a function of I, r, E, 4’, and only. 
For spherically symmetric problems, V is an operator function of the radius 
operator Q = (Q’)'’”. In general it also depends upon the internal ob- 
servables, i.e., upon the operators that change the internal quantum num- 
bers 7. 

We use the fact that [Q, V] = 0 to write 


Cn lV Fn = lA aIIVOINMD, 


where <r|7> is the 6-function with respect to the continuous summation 
over F and <n'|||V(r)|||7> is a doubly reduced matrix element depending only 
on r, 7, and n’.? Therefore 


Crn'||VIIEn* >, = Xo <rn'V li) FH En* >, 


= 21 Cn'IIVOIUAD FA En* >, 


' <n’ |En*>, cannot, in general, be written as <r|E*)>76,,, because [H, °°] = 0 may not 
be true. Thus in general | Ey ~ > may not be an eigenvector of y°?. Only if [H, 7°? ] = [V. n°] = 0 
is this true. Cf. the remark following Equation (XVI.1.3). 


> Applying the Wigner Eckart theorem for the group generated by @Q to the scalar operator 
with respect to this group, V. 
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Performing the summation over 7 and inserting the result, 
<rn'VILEn* >, = 2 <n'IVO@NlliD <rA|En*>,, 
n 
into (2.3c’), one obtains 


<rOpn'|V|Ellsn™ > = Yu,(8, >) d, <n'IV@INA> <r] En*>,. (2.30) 


Using the same arguments, one can also show that 


1 1 
(tp | ar = 1) | ee. + 
(; on lp: rae Elly) (9, oy(m = cea ren , 


(2.3d) 


We shall now use the results (2.3) to obtain the Schrédinger differential 
equation for the radial wave function. In Section XVII.3 we shall consider 
the differential equation for the free radial wave function. Then, from the 
Lippman-Schwinger equation (2.1) for the wave function, we shall obtain, 
in Section XVII.4, the integral equation for the radial wave function. The 
advantage of the integral equation over the differential equation is that the 
former incorporates the boundary conditions; in this particular case the 
boundary conditions appropriate to the Lippman-Schwinger equation for 
|Ea*> are those for a free incident state, as discussed in Section XV.2. 

We assume that the incident beam has projectiles of one kind, whose 
mass m is not changed by the scattering process. The differential equation 
is then obtained by taking the transition matrix element of the operator 
H = K + V= P?/2m + Ki"(n°”) + V, where P is the projectile momentum 
operator: 


E<r6on'| Ell,n* > = <r0gn'| A | Ell;n*> 
= <rOgn'|(P?/2m + Ki + V)|Ellgn*> 
lieved fiend i + 1) 
= wD pnt 
x (= ra ) ei aa adi 
x Cron’ |Ellsn*> + <rObn'|V\Ellsn*>  — (2.4) 
{The second equality follows from Equation (VII.2.6) and (VII.3.4). ] 


Using (2.3a, b, c) in (2.4), we obtain a system of coupled differential equa- 
tions: 


ie ca ' 
2 {5 dr ( +) — ? eae bi 


fi 


amc CO <rii|En™ >, = 0, (2.5) 


where 


P(E) = ./2m(E — E'y"'). (2.6) 
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In order to solve this system of equations one has to known the reduced 
matrix elements of the potential <y'|||V()Illy>. These depend, of course, 
upon the particular process under investigation. For instance, in the many- 
channel problem of the elastic and inelastic scattering of electrons by atoms 
these reduced matrix elements can be determined from the Coulomb inter- 
action and the atomic wave function.° 

We shall not discuss the many-channel problem here any further* but 
restrict ourselves to the case 

[¥, n°] = 0. (2.7) 


Then the problem reduces to a one-channel problem for each value of 
(i.e., for each set of values of the internal quantum numbers). To perform 
the reduction, we use the above arguments to write 


Cn WV OAD = Syn Vi). (2.8) 
Since we now have [H, n°?] = 0, we can also write 
Cry |En* >, = OyySt E> D1. (2.9) 


Under these conditions (1.5) goes over into a set of uncoupled equations, 
one for each channel 7: 
1d d Ki + 1) 
E ay (P<) aes & —2mV,(r) te PAB? | Gib =) (2.10) 
In the following sections we shall discuss the properties of the radial wave 
functions <r|E*>?. Though V,(r) may vary with n, and p, may differ from 
2mE = p? by different constants 2mE;", the label 7 is inessential, as it does 
not change in the scattering process. We shall therefore suppress it in most 
of what follows, reinstating it at the end to emphasize the fact that further 
quantum numbers may be present. 


XVII.3 The Free Radial Wave Function 


The equation for the free radial wave function <r| E>, is obtained from the 
eigenvalue equation of K: 


E<rOon'| Ell3n> = <r0pn'| K| Ell3n> 


in the same way as (2.5) is obtained from (2.4). It differs from (2.5) only in 
that the term in V is missing and that since [K, n°? ] = 0, 


<rii|En>, = oa,<r| EDT. (3.1) 
We thus have 


l dyed ead 
wal" $] -4 : + wheres =o (3.2) 


r 


> For the scattering of electrons by hydrogen atoms this is discussed in Massey et al. (1969, 
Vol. 1, Section 7.1), A more general case is treated in Smith (1971, Section 2.1). 


* We shall return to the many-channel problem in Chapter XX. 
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where p = p,(E) = ,/2m(E — E'"'). We shall write <r| E>? = <r| p>}. Equa- 
tion (3.2) is a well-known equation, which has as its linearly independent 
solutions the spherical Bessel and Neumann functions. The solution that 
is regular at r = 0, to which <r|p), is proportional, is the (proper) spherical 
Bessel function (spherical Bessel function of the first kind, or Riccati—Bessel 
function):> 


n\1/2 
J(pr) = j(z) = (Z) © Sis12(2) 


ney (: al (= ‘) (33) 
z dz Zz 


ERS - (—z?/2)" = 
= ST. Seer er pr ae 
nzo N(21 + 2n + 1)!! 
and the solution irregular at r = 0 is the spherical Neumann function (spheri- 
cal Bessel function of the second kind or Riccati- Neumann function), 


1/2 
n(z) = —(-1) (=) J-1-122) 


I 
= —(—z)' (: +) (= ‘) a 
zdz Zz 
ee oe en — 1)! 
= > ee 


Here J; 1;2(z) and J_,_,,.(z) are ordinary Bessel functions of the first kind. 
The spherical Hankel functions, defined by 


hz) = jz) + in(z), (3.5a) 
hf(z) = j(z) — in(z), (3.5b) 


are then also independent solutions of (3.2). We note for later use that all the 
above relations also hold for complex values of z. 
The asymptotic behavior of j,(z), nz), and h,(z) for real z > 00 is 


ie In 
jz) ce > sin (: = *). (3.6a) 
n(z) ~ — * cos (: -- 3), (3.6b) 
art 
h{z) ~ es e™ (3.6c) 


5 The double factorial is defined by 


n(n — 2)-++(5)(3)(1) if n is odd, 
n(n — 2)---(4)(2) if n is even. 


nit= 
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For future reference we note the bounds for complex values of z: 


|z| opal Poe 

|j(z)| < const esr pie (3.6d) 
|z| ci —-Imz 

|h,(z)| < const eae e : (3.6e) 


Near the origin we have 


z! 


ee EY 
n(z) > [—(2'— 1)!z7'?. (3.7b) 
z>+0 
The normalization of the j,’s is 
i T 
|? ar orton) = 35 40 - P?, (3.8) 
0 2p 
fe p> dp jpr)j(pr’) = 5a OF =F): (3.8b) 


The spherical Neumann function cannot be delta-function normalized 
in this way. All spherical Bessel functions j,(z), n,(z), h(z), and hf(z) fulfill 
the functional equation 


d 
no 7 ee a ne) (3.9a) 


and the following equality for the indefinite integral: 


fr dr fi(pr) fi(p'r) = —s— eer Lesi(or) fi -1(p'r) — pfi-.(pr) fF ry], 


(3.9b) 


of which (3.8) is a consequence. 

To find the proportionality factor between <r|p>, and j,(pr), we calculate 
the normalization integral of the free radial wave function <r|p)>,. In spheri- 
cal coordinates the 6-function normalization of the generalized position 
eigenvectors reads 


(x|x) = 2 = 9 d(r — r')6(6 — 8')6(o — ¢'); (3.10) 
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and comparison with 


<x |x") =) | p(E) dE <r09| Ell; <Ell;|r'0'') 


lls 


=), | p(E) dE ¥,,(6, 6)<rl E> YH, 6')<r' EDF 


U3 
1 
= Fp 10 — 5b ~ 4°) | pC) dE <rIEDX<r|EDH, 
where we have used 


1 
» ¥i,(9, O)Vi,(8, 0’) = a OG — 0 Xp ee (VIL 21) 
shows that the normalization of the radial wave function is 


[oCe) dE <r EW IEE = Or G.11) 


or, in terms of p, 


[o? dpsrip>.cr'Ipyt == 50 — ¥) G.11’) 


As <r|p>, is a solution of (3.2) regular at r = 0, and therefore proportional 
to j,(pr), one obtains by comparison of (3.11’) with (3.8b) that 


<r| p>. = ./2/nj,(pr). (3.12) 


XVII.4 The Exact Radial Wave Function 


The precise properties of the exact radial wave functions <r|E*>, depend 
upon the interaction Hamiltonian V, through the interaction potentials 
V(r) = <xn|V|xn>. However, provided V fulfills some very general con- 
ditions, one can make some general statements about <r|E*)>, without 
knowing the detailed form of V. In our cavalier treatment of the mathematical 
aspects these conditions have never been precisely stated, but their content 
is roughly as follows: 


1. The interaction is not effective for large projectile-target separation, 
i.e., V(r) > 0 as r + «©. Which precise assumption one has to make 
for the rate of falling off V depends on what one wants to achieve: 
Usually one assumes that V(r) falls off at least as fast as 1/r°. For the 
derivation of some properties one has to assume that V(r) decreases 
faster than any power of I/r, and often one assumes a finite range for V. 

2. The interaction does not change too abruptly as the projectile ap- 
proaches the target: V(r) is a continuous function of r except at a 
finite number of finite discontinuities. 


380 Free and Exact Radial Wave Functions 


3. There exists a lowest energy value (since the binding energy must be 
finite) or in other words the spectrum of H has a lower bound. In terms 
of the potential this means that V(r) is not too singular at the origin 
r = 0 (less singular than r~ *’? if V(r) < 0). 


Almost all physical potentials fulfill these conditions—potentials for 
electrons scattering off an atom, potentials for atom-atom scattering, the 
Yukawa potential, the square-well potential, etc. The Coulomb potential 
does not fulfill condition 1 and therefore requires a separate mathematical 
treatment. In practice, however, this is not of much consequence, because 
electromagnetic screening is always present: One never really has the exact 
Coulomb potential 1/r, but a screened potential Y(r)/r, with Y(r) = 0 for 
large r.° For the following discussion we assume that the above three con- 
ditions are fulfilled in whatever precise form we need. 


If 
V(r) > 0 for all r, (4.1) 
then the potential is said to be repulsive, while if 
V(r) <0 for allr, (4.2) 


then it is said to be attractive.’ If V(r) < 0 in some interval r; < r < rj, then 
V(r) is said to be attractive in that interval. If V(r) is attractive in some 
interval, there may be a certain number of solutions of the differential equa- 
tion (2.10) for a certain number of discrete, negative values of p;/2m = E,. 
These solutions correspond to bound states of the projectile-target system 
with binding energies | E,,|, as we shall discuss in the next section. They must 
fall off sufficiently rapidly as r > oo, in addition to being regular at r = 0. 
The exact radial wave function <r|E*>, = <r|p*>, for p? > 0 is the 
solution of the differential equation (2.10) with certain boundary conditions: 
In addition to being regular (i.e., <r|p* > < 00 at r = 0), one would want, as 
poco, <r|p° >, to represent the /th spherical component of an incident 
plane wave and an outgoing scattered wave, and that for V + 0, <r|p*>, > 


{r|p>, = ./2/nipr), the acceptable solution of the free wave equation. 

Instead of solving (2.10) under these boundary conditions (see Problem 
XVII.1), we shall obtain the exact radial wave function <r|p* >, from the 
Lippman-Schwinger equation (2.1), which has the boundary conditions 
already built in. Inserting (2.3a), (2.3b), (2.3d), (2.8), and (2.9) into (2.1), one 
obtains the integral equation for the radial wave function <r|E~ >,: 


<r|E*>, 


= <rED + | eB) dB r? dr! CE, pee EVIE Dy 


(4.3) 


* Note also that in general any assumed J'() is a mathematical idealization of the actual 
situation. 

7 The forces between projectile and target are then also called repulsive and attractive, 
respectively. 
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Now define the (/th-partial-wave) Green’s function by 


LED | EDE 


Grae) = 5 [ ote) ae SE Pe 
Se ar a9) 
=5\ ew gopte 49 
=F) ew (4.7) 


where we have used the fact that the integrand is even. The integral equation 
(4.3) then becomes 


Gp"): = Gp), + 2m | 2 dr! GPs VP) IP* Ys 


= ./2/nj(pr) + 2m i r’? dr’ GPs r')V(r'Xr'|p*>;. (4.8) 


Note that if the potential has a finite range, then the integral is over a finite 
interval. 
The Green’s function is evaluated using Equation (3.5) to write 


‘A eine jdarhiar') =|. Jlkr)hi(kr’) 
a 75 AIEEE kk? eyo 
oe) al aad eee 4 —k*+ie Oe 
Suppose r’ > r. Then because of the asymptotic behavior (3.6e) of h,, and 
(3.6d), the product j,(qr)h(qr') decreases exponentially in the upper half 
complex g-plane, and one may extend the integration path of the first integral 
to include a large semicircle in the upper half plane (cf. Figure 4.1). From 


Equations (3.3), (3.4), and (3.5) it is seen that 
jkr) = (—1j(—kr) and hit(kr’) = (—1)'h(-kr’) 


Im q 
T(R) 
——+ — Re q 
ZR +R 


Figure 4.1 The integration path I'(R) used to evaluate the first integral 
in Equation (4.9). As R > 00, the contribution from the semicircle 
vanishes, leaving only the integral along the Re q axis. 
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and hence with the substitution gq = —k, the second integral of (3.9) is the 
same as the first integral. We may now rewrite (3.9) as 
see” qikarh(qr’) ( € 
Grr) = tim 5 f d = pa \) 
en) = Om an Ir! @ + aa = Pi) p+e 


where I'(R) is the closed contour indicated by Figure 4.1. The spherical 
Bessel functions are entire functions, and the function q7j,qr)h(qr’)/(p + 4) 
is analytic inside I(R), so by the Cauchy residue theorem® we obtain 

2 


G(s 1’) = — 21 jnhior) 


= —ipj(pr)h(pr’) (> 7); (4.10a) 


A similar procedure, but with r andr’ interchanged, gives the Green’s function 
forr’ <r: 


GP(r; r’) = —ipj(pr’)h(pr) (F247). (4.10b) 
Equations (4.10) are often combined as 
GP(r; 1’) = ipj(pr <)h(pr>) (4.11) 
where 
r< = min(r, 7’) (4.12a) 
and 
re = max(r, 7 ): (4.12b) 


We already know the asymptotic behavior of the free radial wave func- 


tion <r|p>, = ./2/nj(pr) for r > oo [cf. (3.6a)}. Let us now examine the 
asymptotic behavior of the exact radial wave function <r|p~ >,. Using (3.6a), 
(3.6c), and (4.10b) in (4.8), we obtain 


™ 4s Le In 
f rip >i eer sin = 4 
13 - , se , —i)'*} i , 
+ am | \ par | incor én] V(r')<r'|p* >, 


xa el — i(pr—In/2) 
= — .e 
2ipr 


— eilpr—tn/2) f = (2ip)(,/2nm) if 7? ar jloryveexsrin* >| 
(4.13) 


® For a review of important results in the theory of functions of a complex variable, see 
Appendix XVII.A. 
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The first term represents an incoming partial wave, whereas the second 
term represents an outgoing one. (We shall discuss this further at the end of 
this section.) The outgoing wave is modified with respect to the incoming 
one by the factor in square brackets. This factor may be related to the elastic 
scattering phase shift defined by Equation (XVI.2.7) for every value of the 
internal quantum number y. Note that having chosen the quantum numbers 
y such that (2.7) holds, <n,||S,(E)||n4> vanishes for », # n,. Thus we have 
here only elastic scattering for every value of n. 

Recall, now, that the /th partial-wave amplitude defined by (XVI.1.35), 
(XVI.1.18), and (XVI.1.23) is 


T(p) = —mm<Ell,|T|Ell;> = —nm<Ell,|V| EF, (4.14) 


where E = E(p) = p/2m + E'™, the label 7 having been suppressed. In 
terms of the radial wave function this is written 


T(p) = —mm [r sin 8 dr dO d< Ell; |r) <r6| V | Ell; > 


—7mm [sin 6 dO dp Yi:,(O, 6)¥,,(8, 6)| 
x [r dr IEVONKrlE*>| 


= —7m { r° dr Eitenvenxeir®> (4.15) 
0 


where Equations (2.3a), (2.3c), (2.8), (2.9), (VII.3.16), and (3.12) have been 
used. Using (XVI.1.47) and XVI.2.7), it follows from (4.15) that 


Sipe) — 1+ Qiph(p) 


= 1 — (2ip)(,/2nm) he drj(oVNrIp*),. (4.16) 


Although Equations (4.15) and (4.16) are important in themselves, ex- 
pressing the /th partial-wave amplitude and the phase shift in terms of the 
potential and the exact radial wave function, they are of little practical use. 
This is because the differential equation (2.10) and the integral equation 
(4.8) are not easily solved for <r|p*>,, nor is <r|p* >, as directly related to 
the experimental data as are 6,(p) and 7;(p). 
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. f <71P%0 ~sin pr 
(RP. 


s 
CNS 


pr 
<rip*>o ~ sin(pr + 59) for r>a 


Figure 4.2. The / = 0 free and exact radial wave functions for an 
attractive square well. . 


We make use of (4.15) and (4.16) to rewrite (4.13) in various useful forms: 


1 In i(pr—In/2) 
/m/2crip >i es * sin(m = =] a2 De (4.17a) 


r7>o 


e 1 {gilpr—Im/2) 9215p) _ @— lpr Ie/2)y (4.17b) 


reer si ii 
eidlp) Ix 
~ i —— 4.17 
ae sin (o 5 + 240) ( c) 
—ilnz/2 ; , 
~ {e'"'S(p) — (— Dyer |: (4.17d) 
eae Cpr 


Comparison of (4.17c) with the asymptotic form of the free wave function 
obtained from (4.12) and (3.6a), 


TT ; eae. Ix 
f <r| p> = ji(pr) aie pr sin (m -) (4.18) 


explains why 6,(p) is called the “ phase shift”: The interaction shifts the phase 
of the asymptotic exact radial wave function <r|p* >, by 6,(p) as compared 
with the asymptotic free radial wave function <r|p>,. This property also 
serves as an alternative definition of 6, in place of (XVI.2.7). The phase shift 
for the | = 0 wave in a square-well potential is shown in Figure 4.2. 

For an attractive potential (like the square well in Figure 4.2) the wave 
function is “ pulled” into the interaction region and the phase shift is positive. 
The exact wave function emerges from the interaction region with its phase 
in advance of the free wave function. For a repulsive potential the wave 
function is “pushed” out of the interaction region and the phase shift is 
negative. 

That this connection between the sign of the phase shift and the sign of the 
potential is generally fulfilled is best established in the small-phase-shift 
approximation: 


e") sin 5,(p) © 4,(p). 
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From (4.15), using (XVI.2.8), one then obtains 


ete amp iL 1? dr /2aidprV)XrIP*Y 


which in the Born approximation is given by 


Hip) = —2mp |r? dr {jdor)}?V09, 
0) 


To fully justify the above statement that (4.13) and (4.17) represent an 
incoming and an outgoing partial wave, we have to consider the time- 
dependent wave function of a state ¢*(t) that develops in time from a 
prepared in-state. 

The exact radial wave function <r|p* >, is the pth component of the radial 
wave function of an angular-momentum eigenstate ¢,/(t) that develops in 
time according to the equation 


bi, = 5 [dk BE)LENS De®, (4.19) 


where $(E) = $(p), E = p?/2m, is the energy (momentum) distribution of 
this state in the remote past.’ The wave function of this state is 


<r0$| O:,()> = Yu,(8¢)<r16* >: — (4.20) 


where the radial wave function <r|¢@*(t)>, of the state @*(t) is given by 
: HO cee es ae 
Crd": = 5 | dE BEM GIE* (4.21) 
0 


Multiplying (4.17d) by ./2/x(1/2m)¢(E)e~‘*' and integrating over E then 
gives 


e” ir/2)0—-1) 0 o 1 : i ; 3 
ae 26 Nae = a=, om ipr 
G16" ~ a — [ dp H(p)e-™ ~ (eI)! — Soyo) 
e7 in/2)t—- 1) [ e” prt ED Dy 
al rn 
co eilpr— Et) 
- | ar dos) — I (4.22) 
0 


Thus the radial wave function of the angular-momentum eigenstate 
i,(t) consists asymptotically of an incoming wave with the original momen- 
tum distribution and an outgoing wave in which the original momentum 
distribution has been modified by S,(p) due to the interaction. Equations 
(4.13) and (4.17) merely state this result in the time-independent form for 


° That this @*(t), connected with the basis vectors | E/l; >, is a state that develops from a 
prepared in-state has been shown in Section XV.2 and is here of no further consequence. 
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the pth component of the radial wave function, ie., for an (unphysical) 
exact momentum eigenstate. 

The asymptotic forms of the radial wave function (4.22) and (4.17) have 
been derived from the Lippman—Schwinger equation under the assumption 
that the interaction is described by an interaction Hamiltonian V. Then 
S,(p) is given in terms of V(r) by (4.16). Even if the time-development axiom 
is not assumed, there exists a unitary operator S that transforms states before 
the interaction into states after the interaction. If this interaction is of finite 
range, then outside the interaction region the wave function should still 
be a superposition of incoming and outgoing waves. Thus the radial wave 
function for large distances r should still be given by (4.22), where S,(p), the 
Ith S-matrix element, is now the fundamental quantity. Consequently, the 
pth component of the radial wave function, <r|p* >,, should have the asymp- 
totic form (4.17). 

Thus in the framework of a Hamiltonian time development, the asympto- 
tic forms (4.17) and (4.22) are derived, while in the framework of S-matrix 
theory the asymptotic forms (4.17) and (4.22) are assumed and are justified 
as a consequence of the superposition principle for the wave function out- 
side the interaction region. 


XVII.5 Poles and Bound States!° 


In scattering processes, the variable p that we have been considering in the 
preceding section is the magnitude of the incident momentum, and therefore 
a real positive quantity. Accordingly, the variable E = p*/2m representing 
the kinetic energy (or E — Ei" = p*/2m if the internal energy has to be taken 
into account) is a real positive quantity. We know, however, that other 
values of E also have physical significance. For example, the solutions of 
(2.10) for negative values of p? = 2mE are the bound-state radial wave func- 
tions, and the discrete negative energy values E,,, for which (2.10) has normal- 
izable solutions <r|E,,>,, are the energy levels of the projectile-target bound 
system (as has been discussed in Section VII.3). 

One should then also be able to extend the integral equation (4.8) [with 

P given by (4.11)], which follows from (2.10), to negative-energy solutions. 
Such solutions can be obtained by the following line of reasoning: Replace 
p by the complex variable z. For Im z = 0, the arguments that lead to the 
Green’s function (4.11) are still valid. (For Im z < 0, similar arguments, 
wherein one closes the contour in the lower half of the complex z-plane, 
hold, and one gets analogous expressions with h, replaced by h*.) 


10 For this section, as well as for some sections of Chapter XVIII, we shall require some 
basic results from the theory of functions of a complex variable. These results are outlined in a 
mathematical appendix to this chapter, Appendix XVIL.A. The reader who is unfamiliar with 
complex variable theory, or who merely wishes to refresh his memory, may consult the appendix 
before reading this section. 
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The fact that (4.11) holds in this general case does not necessarily mean 
that (4.8) can also be continued to Im z > 0. Let us assume, however, that 
the potential V(r) is such that it can be so continued, and let us assume that 
H has a bound state with energy E = —a*/2m. Then, at the point z = ia, 
the radial wave function <r|ix*, (being, as a wave function of a bound 
state, an element of the space of infinitely differentiable, rapidly decreasing 
functions that is a realization of the Schwartz space) must asymptotically 
decrease faster than any power of r~', e.g., it must decrease exponentially. 
Now according to (3.6d), j,(izr) diverges exponentially. The second term in 
the RHS of (4.8) is bounded as r > 00, because h,(iar) decreases exponen- 
tially according to (3.6e). Thus <r|ix*>, can decrease for r > 00 only if the 
terme<r|ip>, = ./2/nj(pr), originating according to (4.13) or (4.17a) from 
the incident state, drops out of Equation (4.8). This can be achieved by con- 
sidering, instead of (4.8), the equation obtained by multiplying Equation 
(4.8) by (z — ia): 


(z — ia)<r|z* >, = (2 — ia)<r|z), 
+ 2ma (a dr’ j,(iar -)h(iar, )V(r')(z — ia)<r’'|z*>,. 
0 
(5.1) 


Equation (5.1) is identically fulfilled at z = ix unless <r|z* >, has a pole 
at z = ia, i.e., in the neighborhood of ia is of the form!! 


1 
— <r|E = —a2/2m), 
10% 


<r|z*>, = 


id 
4 


+ (function of z analytic around z = ia). (5.2) 


Then (5.1) goes over into 
<r|E = —a?/2m), = 2ma { r? dr’ j(iar )h(ior, V(r')<r' |E = —a?/2m),, 
0 


(5.3) 


which is the homogeneous integral equation for the discrete number of 
normalizable solutions of the Schrodinger equation (2.10) with eigenvalues 
E = —«?/2m. Thus we see that a bound-state solution is obtained if we 
assume that <r|z* > is of the form (5.2), i-e., that it has a pole on the positive 
imaginary momentum axis at the point z = ia = i,/2m|E|, where E is the 
bound-state energy eigenvalue for angular momentum /. The bound-state 
radial wave function <r|E = —«?/2m), is then the residue at this pole. 
From (4.15) and (4.16) one then concludes that the /th partial-wave am- 
plitude and the /th S-matrix element have poles on the positive imaginary 


'! The above arguments will also hold for a pole of higher order, but one can prove that these 
poles are simple (which we shall not, however, do here). 
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momentum axis at the values p = i,/2m|E‘|, where E,, (n = 1, 2,...) are the 
discrete eigenvalues of the Hamiltonian H. In a theory with Hamiltonian 
time development, these discrete eigenvalues correspond to the energy levels 
of the projectile-target bound states of angular momentum le 

Though the above arguments’? show that a bound state of the Hamiltonian 
with energy E! corresponds to a pole of the /th partial S-matrix element 
Sp) on the positive imaginary axis, the reverse need not hold: T,(p) and 
S,(p) may have singularities that are not connected with the bound states.** 

Nevertheless, the hypothesis that bound states of angular momentum /| 
correspond to poles of S,(p) on the positive imaginary axis (and vice versa) 
has become quite generally accepted. Especially in those cases where a 
Hamiltonian time development is not assumed to exist (relativistic S-matrix 
theory) and there is no operator whose spectral properties characterize the 
bound states, one usually assumes a one-to-one correspondence between 
bound states and poles on the positive imaginary axis. 


XVII.6 Survey of Some General Properties of Scattering 
Amplitudes and Phase Shifts 


The precise properties of the scattering amplitudes and the phase shifts 
depend upon the interaction. If the interaction is described by a potential, 
one can calculate phase shifts and partial-wave scattering amplitudes as a 
function of scattering energy, as has been done in some of the problems. 
There are, however, general properties of these functions that do not depend 
upon the particular form of the interaction, and we want to list them here. 
The Born approximation for the partial-wave amplitude, obtained from 


(4.15) by inserting there <r|p*> = <r|p> = \/2/nj,(pr) namely 
se 2 a 
TiP"™(p) = —xm = i r? dr j(pr)V (r)j(pr), (6.1) 
0 


is believed to be good for high energies and weak potentials, because in the 
higher-order terms the potential and the spherical Bessel functions, which 
fulfill (3.6a), appear in higher powers. 

The asymptotic behavior of the /th partial S-matrix element is then 


S(p) 71 asp o. (6.2) 


This follows immediately from (4.16) using the Born approximation (6.1) 
and the asymptotic behavior (3.6a) of j,(pr). An intuitive argument for (6. 2) 


12 For potential scattering, the connection between simple poles and bound-state eigen- 
values of H can be made more precise, Cf. Taylor (1972, Chapter 12). 

'3 These “redundant” poles may occur if the asymptotic expressions in the complex p-plane 
differ from (4.17). For interactions that are cut off at a finite distance and also for potentials 
that fall off at infinity faster than any exponential (e.g. e “”), “redundant” poles are absent. 
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is that with increasing energy of the projectile the effect of a given interaction 
will become less important. 

The phase shift 6,(p), therefore, tends to a multiple of z as p > oo. As the 
phase shift is defined by (XVI.2.7), only within a multiple of z one can remove 
this modulo z ambiguity by defining 


6(p) +0 asp — oo. (6.3) 


Requiring then that 6,(p) be a continuous function of p (which is possible 
because $,(p) is continuous) make 6,(p) unique. 

Clearly S,(p) > 1 also if the interaction goes to zero. Then with the con- 
vention (6.3) it follows that also 


6(p) +0 for interaction going to zero [V(r) > 0]. (6.4) 
For a given potential and energy 
T, 70 and §,>-1 aslo om. (6.5) 


Intuitively, this can be understood from (2.10) by regarding the term 
(1 + 1)/2mr? as a repulsive centrifugal potential. The larger / is, the more 
repulsive is this centrifugal barrier and the less effective does the actual po- 
tential V(r) become. Let R be the range of the interaction. Then for values / 
such that /(/ + 1)/2mR? is much larger than the kinetic energy E = p?/2m 
the projectile is unlikely to penetrate into the range of the interaction. Thus 
for 


1 + 1) > p?R?, orl > pR, 
the partial-wave amplitude 7;(p) will be negligible: 
Tp) = 0, Sp) =1 for] > pR. (6.6) 
It follows from (6.6) that for the phase shift 
6, % nn, n integer for 1 > pR. (6.7) 


Equations (6.6) and (6.7) are important properties of which one makes use 
when one expresses the differential cross section in terms of the partial-wave 
amplitudes as in (XVI.1.36). For a given value of E = p*/2m the infinite 
sum in (XVI.1.36) can, because of (6.6), be approximated by a finite sum. 

To obtain the phase-shift behavior at low energies, we turn to (4.15). For 
small values of p, <r|p* >, and j(pr) have the same p-dependence, given by 
(3.7a), since according to (4.11), 


Heat Te | 
, 7 pre prs (21 Wes atl et al 
Dp = SF —_ — 4 JL. j| ee Tee 
CV) IP oa aalai” nies PP 
r! r_'-} 
GPr, r') > — = for p> 0 (6.8) 


Zl 
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and <r|p*>, and j,(pr) are connected by (4.8). Consequently, it follows from 
(4.15), for potentials that vanish for r greater than some R, that 


1 Te 
T(p) > mn( | r a Pervert aan Ort re Di ep) or) 


T(p) > —a,p"' for p>0 (6.9) 


where a, is a constant given’ by the integral in brackets, which does not 
depend upon p, because <r|p* >,/j,(pr) > (function of r) for p > 0. ee 

The constants a, are called the scattering lengths. Only the s-wave scatter- 
ing length a, has the dimension of a length. (6.9) shows that for p = 0 all 
partial-wave amplitudes except the s-wave vanish. For the s-wave 


T)(p) > —dg asp), (6.10) 
and consequently by (XVI.1.37) and (XVI.1.39), 
o> a, o—>4naz, asp—0. (6.11) 


Using (XVI.2.8) it follows further that 


6(p) — nt —> —ap”'*' asp 0, (6.12) 


and consequently also 
tand6,(p) > —a,p?'t! asp—0. (6.13) 
Equation (6.13) shows that 


1 
Do C0000) >a 
a; 
or their inverses are the appropriate functions to consider near p = 0 (as 
long as the scattering length a, 4 0 or a, # 0). The power series for this 
quantity near p = Ois: 
| Re oe 
p*'** cot 6(p) = —— + 5 p® + O(p*). (6.14) 
I 
That only even powers appear in the power-series expansion of p*'** cot 6,(p) 
follows from 6,(— p) = —0,(p), which is established in Equation (XVIII.5.7) 
below. 
For | = 0, (6.14) is a very useful approximation called the effective-range 
approximation: 


1 
p cot 6{p) = — — aaa Z =e. (6.15) 


'4 Tf the potential does not have a finite range, some modifications are necessary, but one 
can show that for V(r) decreasing faster than any power of 1/r the result (6.9) still holds. 
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The constant ro is called the effective range of the potential; one can show 
that ro is roughly proportional to the range of the potential. Equation (6.15) 
is a good approximation for energies small compared to the potential energy 
and has been one of the most important parameterizations of low-energy 
scattering data, in particular for neutron-proton scattering. 

Without giving a derivation, we mention that the sign of ag is related to 
whether or not an S-wave bound state is present. If the scattering length 
ay < 0, no bound state is possible, and for ay > 0, a bound state of target 
and projectile is formed. 


XVII.A Mathematical Appendix 


In this appendix we review some basic results in the theory of analytic func- 
tions of a complex variable. For proofs of these results, and for more on the 
theory of analytic functions, the reader is referred to, e.g., Smirnov (1964, 
Vol Il, Part 2). 


a. Definition of Analyticity 


The function f(z) = u + iv of the complex variable z = x + iy is said to be 
analytic in some region R of the z-plane when 


1. df/dz is continuous and independent of the direction of dz in the 
region R. (The following conditions are equivalent.) 

2. du/dx = dv/dy du/dy = —dv/dx (Cauchy—Riemann conditions). These 
derivatives are also continuous in R. 

3. 4$¢f(z) dz = 0 for any closed contour C lying within R. 


If f(z) is analytic in R, then so is d"f/dz” for alln = 0,1, 2,.... 


b. Cauchy Integral Formula 
If f(z) is analytic within some simply connected region R, then 
p flan = 2nif (a), (A.1a) 
z—a 


2ni af (a) 


n! da" 


(A.1b) 


pea 


f(z)dz dui 
Te n! 


= ie x 


for contours entirely within R and enclosing the point a. If a is excluded, the 
integrals vanish. (Integration is assumed to be counterclockwise.) Thus the 
values of f(z) within R are completely determined by the values on the 
boundary. 
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c. Taylor Series 


If f(z) is analytic within and on a circle C centered at z = Zo, then one can 
expand f(z) about Zp: 


Vie Sate = ay (A.2a) 
n=0 
where 
1 
canes (Za): (A.2b) 


This Taylor series is uniformly convergent as long as the radius of C is less 
than the “radius of convergence,” which is the distance from Z, to the nearest 
singularity of f(z), i.e., the nearest point where f is nonanalytic. Conversely, 
any power series represents an analytic function within its radius of con- 
vergence. 


d. Laurent Series 


If f(z) is analytic within and on the annular region between two concentric 
circles C, and C, centered at z = Zo, then for any point z within the annular 
region one has the uniformly convergent power series 


[o.0) 


f(z) = o a,(z — Zo)", (A.3a) 
where 
f(z) dz 
Gn no =¢ (z = =z)" (A.3b) 


C being any contour within the annulus. 


e. Singularities 


Points of nonanalyticity of f are called singularities of f. 


1. A single-valued function f can have isolated singularities of the 
following kinds: 


(a) A singularity at z = ais called a pole of order nifasz—a 
g(z) _ 
(z — a)" 


where g(z) is analytic at z = a, and g(a) # 0. A pole of order 1 
is called a simple pole. 


far 


(ni 22) (A.4) 
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(b) If f(z)(z — a)" diverges as z = a for all finite n, then a is called 
an essential singularity of f. 

In the Laurant series for f about a pole of order n, —n is the lowest 
power one encounters. The Laurent series about an essential singular- 
ity contains infinitely many terms of negative power. 

2. A multivalued function f (e.g., f = 2?) has branch points, which 
always occur in pairs. At a branch point z = a, the function f(a + ¢e'?), 
where ¢ is such that a + ee’* does not include another branch point of 
f for all ¢, is not periodic in @ with period 27. 


A function that is analytic in a region of the complex plane, except for a 
finite number of poles in that region, is said to be meromorphic in that region. 
If the function is meromorphic in the entire z-plane, except at the point co, 
it is called a meromorphic function of z. If the function is analytic in the entire 
z-plane, except at 00, it is called an entire function. 


f. Cauchy Residue Theorem 


The coefficient a_, in the Laurent series is called the residue of f(z) at 
Z = Zo. If f(z) is analytic within and on a closed contour C, except for a 
finite number of poles and essential singularities, then 


f (2) dz = 2ni ¥ (residues in C) (A.5) 
c 


g. Analytic Continuation 


Let two functions be analytic in a region R, and have the same values (a) 
in some subregion of R, or (b) on a line segment in R, or (c) at a denumerably 
infinite number of points having a limit point within R. Then they are 
identical throughout R. However, if one knows only that they differ at most 
by an amount € in (a), (b), or (c), then, no matter how small |¢| is, the two 
functions may differ vastly within the rest of R. 

The facts stated above illustrate the justification and the limitations of 
analytic continuation: If f,(z) is defined on a line segment I, and /,(z) is 
analytic in a region R containing I, and if f;(z) = f,(z) on T, then f,(z) is a 
unique analytic continuation of f,(z) onto R. 

In the same spirit one can also do the following: Let /,(z) be given within 
a circle R,,e.g., by a power series about the center z,. Let f,(z) be the power- 
series expansion of f,(z) about a point z, near the periphery of R,, the circle 
of convergence being R,, which extends beyond R,. Then the values of 
f(z) in R, are uniquely determined by the values of f,(z) in R,, and f,(z) is 
the analytic continuation of f,(z) into R,. One can repeat this process 
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ae . 


Figure A.1 Extension of a function defined in R, into the region 
R,UR;U R4U--: by analytic continuation. 


several times, obtaining a unique analytic continuation F(z) of f,(z) into 
the region R, UR, UR; vu... (see Figure A.1). A function cannot be 
analytically continued into a region containing a singularity of that function. 

After repeating the process of analytic continuation several times, the 
nth circle of convergence may overlap the first one. Then the values of /,(z) 
in the overlap region may or may not coincide with those of f,(z). In the 
latter case the function is multivalued, and the region of encirclement (see 
Figure A.2) contains a simple branch point of the function. 

We have seen above how the values of an analytic function in a large 
region of the complex plane are determined by its values in an arbitrary 
small region of analyticity. However, the analytic continuation does not 
depend on these initial values in a continuous manner. Therefore, it is im- 
possible to express F(z) in R explicitly in terms of the values in a certain 
small region. Though F(z) is uniquely prescribed by its values in the sub- 
region, the problem of constructing F(z) in R from its values in the small 
subregion is not a correctly formulated one. 


R 
branch : 
point 


overlap 


Figure A.2. Analytic continuation of a multivalued function around a 
branch point. 


Mathematical Appendix 395 
h. Schwarz Reflection Principle 


If f(z) is analytic in a region R that includes a segment of the real axis and 
if f(z) is real on this segment, then f(z) can be continued onto the region 
R* = {z*|ze R} and satisfies f(z) = [ f(z*)]* for all ze R U R*. 


i. Multivalued Functions 


The function w(z) can be considered as a map of the z-plane onto the 
w-plane. Consider the example w(z) = z’, Figure A.3. It is clear from the 
figure that the upper half of the z-plane, including the positive real axis 
and excluding the negative real axis, is mapped onto the entire w-plane. 
So also is the lower half of the z-plane, including the negative real axis 
and excluding the positive real axis. Thus one complete circuit of the 
z-plane (e.g., efgpqr) involves two complete circuits of the w-plane (e.g., 
e'f'g p'q'r’) provided the former circuit encloses the origin. 

The inverse of a function such as w(z) is a multivalued function. To con- 
tinue with the above example, consider the inverse “function” 


f(2) = 21, 
Writing z and f in polar form, 
B= oe, f =e’, 
we have 
a, O= @/2. 


In making one complete circuit around the origin of the z-plane,0 < ¢@ < 2z, 
one has 


Ae Ale 
te me ae 


and the part of the f-plane so covered is the upper half including the +Re f 
axis and excluding the —Re f axis. Going around the z-plane once more 


(a) (b) 


Figure A.3. The function w(z) = z? (a) z-plane; (b) w-plane. 
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z-plane f-plane 


(a) : (b) 


Figure A.4 The function f(z) = z'/? (a) z-plane; (b) f-plane. 


(2n < @ < 4n), the point with polar angle ¢ now has polar angle @ + 27; 


consequently 
if = pice enie = —p'/2giei2. 


Making a third circuit would give back the value f, for. Clearly fis a double- 
valued function of z (Figure A.4). 

We have seen that it requires two circuits around the origin of the z-plane 
to encircle the origin of the f-plane, thus giving rise to the double-valued- 
ness of f. On the other hand, starting with some point z (#0) in the z-plane 
with well-defined argument'> @ and following a closed contour C, one traces 
out a closed contour C’ in the f-plane (Figure A.5). Thus, as long as one 
does not encircle the origin one can treat fas a single-valued function. 

The two values of z!/? that occur for each value of z form two independent 
sets, called branches of z'!? (e.g., upper and lower 2” ?_plane). On crossing 
the positive real z-axis one goes over from one branch of z z}/? into the other. 
The positive real axis is called a branch line. If one analytically continues 
one branch of the function z!/? along a circle that enclosed the origin, one 
ends up with the othet branch. The region of encirclement then contains a 
singularity of z*/?. Since the circle can be made arbitrarily small, it is clear 
that the singularity is z = 0. This is called a branch point of the function 
ge 

At a branch point, the multivalued function f(z) has the same value for 
all branches of z. Branch points always occur in pairs, with a branch line 
joining a pair of branch points. In our example z = 00 1s the other branch 
point. Any curve joining these two branch points can serve as a branch 
line —the choice is a matter of definition and usually dictated by convenience. 


j. Riemann Surfaces 


The theory of analytic functions outlined above for single-valued functions 
can be extended to a wide class of multivalued functions using a geometrical 
construct called a Riemann surface. 


15 More precisely, defined up to integral multiples of 4z. 
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Ay 


(a) (b) 


Figure A.S Closed contour in one branch of the function f(z) = z!/2 
(a) z-plane; (b) f-plane. 


We shall describe the Riemann surface for the function f(z) = z!/?. We 
take two z-planes, call them R, and R,, and define the branch line to be the 
positive real axis. We “cut” R, along the branch line (a branch cut), or 
actually “a little below” it; we do the same for R,. 

We now join the lower lip of R, to the upper lip of R,, and the lower lip 
of R, to the upper lip of R,. The surface thus obtained is the Riemann surface 
for the function z’/? (Figure A.6). 

We now define $ on R, to have the range 0 < ¢ < 27, and ¢ on R;j to 
have the range 2x < @ < 4x. Thus on the entire Riemann surface ¢ takes on 
the values 0 < ¢ < 4n. Starting from ¢ = 0 in R, and encircling the branch 
point, one reaches the branch cut. Crossing the branch cut takes us onto the 
upper region, R,. Crossing the branch cut once more after encircling the 
origin, one returns to R,. For z in R, [R,] one has f(z) = f,(z) [2(2)), 
i.e., each Riemann sheet of the composite Riemann surface corresponds to a 
single-valued branch of the composite multivalued function. 

What we have achieved is the following: From a sequence of single- 
valued functions [branches of the multivalued f(z)] defined on one complex 
z-plane, we have obtained one continuous single-valued function defined 
on the Riemann surface. f(z) is now analytic over the entire Riemann surface 
except for the branch points, which are now to be treated as (isolated) 
singularities. 

A branch point is said to be of order n if the (multivalued) function is 
returned to its original value after encircling the branch point at least n + 1 
times. If this cannot be done for finite n, the branch point is said to be of 


{a> 


ag 


Figure A.6 Riemann surface for f(z) = z’/”. 
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infinite order. Thus for the function z'”, the branch points z = 0 (and 
z = o0)!® are of order n — 1. The Riemann surface has n sheets. 
An example of an infinite-valued function is the function 


f() =Inz 
=Inp + i(p + 2nn); (A.6) 


each encirclement of the branch point z = 0 increases the value of In z by 
2ni. Accordingly, z = 0 is a branch point of infinite order, and the Riemann 
surface has an infinity of sheets. - 


Problems 


1. The square-well potential is given by 


-—VY ifr < a. 
0 ify = a: 


a= 


(a) Calculate the Born approximation of the scattering amplitude. 
(b) Show that the exact expression for the [th partial-wave amplitude is given by 


1 pjdka)j(pa) — kii(ka)j(pa) 
p kjkayh(pa) — pi{ka)h\(pa) 


where k = ./p? + 2mV, (momentum inside the well) and j,(z) = dj,/dz. 
(c) Show that at low energies the s-wave, | = 0, dominates so that T(p, 8) > To(p). 
(d) Compare the correct low-energy amplitude with the Born approximation and 
show that at low energies the Born approximation is good only for a shallow 
well. : 
2. Calculate the partial cross section o;- for scattering by a square-well potential 
in the limit of zero scattering energy. 


T(p) = 


'© f(z) is said to have a singularity at z = oo if f(1/z) has a singularity at z = 0. 


CHAPTER XVIII 


Resonance Phenomena 


Resonance phenomena constitute some of the most interesting and striking 
features of scattering experiments. This chapter discusses in detail the con- 
nection between quasistationary states and resonance phenomena, and cul- 
minates in the derivation of the Breit- Wigner formula. In Section X VIII.2 the 
concept of “time delay” is introduced and its relation to the phase shift 
derived. Various formulations of causality are given in Section XVIIL.3. In 
Section XVIII.4 the causality condition is used to derive certain analyticity 
properties of the S-matrix. These properties are discussed further in Section 
XVIII.S. In Section XVIII.6, the central section of this chapter, the con- 
nection between quasistationary states, defined by a large time delay, and 
resonances, defined by characteristic structures in the cross section, 1s 
derived. Section XVIII.7 describes the observable effects of virtual states. 
Section XVIII.8 discusses the effect resonances have on the Argand diagram. 
The actual appearance of resonances in experimental data when the effects 
of the resonant phase shift, the nonresonant background, and the limited 
resolution of the apparatus are taken into account is discussed in Section 


XVHL9. 


XVIII.1 Introduction 


In our study of the discrete energy spectra of atoms and molecules we treated 
the excited states as if they were infinitely long-lived. When we discussed in 
Section XI.4 the helium atom and its levels, which have the same energy 
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value as the physical system consisting of an He* ion and an electron, we 
ignored the presence of the He*-e~ system and used an approximate descrip- 
tion in which the helium atom was considered to be an isolated stationary 
system, even though the helium-atom system can never really be isolated 
from the He*-e~ system with which it interacts. In this approximate descrip- 
tion of excited states the transition processes could not be described. In the 
more accurate description, which does not ignore transitions, excited states 
have a finite lifetime. This lifetime is fairly long for the ordinary discrete 
energy levels, but is not so long for the discrete energy levels that lie in the 
continuous spectrum of the He-(He*-e~) system; the lifetimes of both sorts 
of states are still, however, much longer than the time taken by the transition 
processes. 

The helium energy-loss experiment, whose result is depicted in Figure 
X1.4.1, is an inelastic scattering process in which a long-lived intermediate 
state He* is produced, which subsequently decays either into a helium ion 
and electron or into a helium atom and photon: 


e + He-e™' + He* 


| L_.Het + e7 (1.1) 
He + y 


The lifetime of a singly excited He*(S), which has an energy level below the 
first ionization threshold, is orders of magnitude larger than the lifetime of 
the doubly excited He*(D).' This is due to the fact that the interaction Vj that 
causes the transition 


He*(D) > He* + e7 
is much stronger than the interaction V, that causes the transitions 
He*(D) — He + y 
and . 
He*(S) > He + y. 


[The transition He*(S) > He* + e is energetically impossible.] In the 
approximation of Section XI.3, in which both ), and Vy are ignored, the 
energy operator is given by 


H=K+4+V, 


where K = K gectron + Kye With Ky. given by (XI1.1.2), and V describes the 
interaction between He and e . Consequently both He*(S) and He*(D) 
have infinite lifetimes. If Y, is ignored but the stronger interaction Vj is not, 
then He*(S) lives infinitely long, but He*(D) has a finite lifetime, which is 
determined by V,. If neither Y nor Vj, is ignored, then both He*(S) and He*(D) 
have finite lifetimes. 


' The width of the bumps in Figure X1.3.1 is mainly due to the resolution of the apparatus 
caused, for example, by the energy spread in the incident electron beam and not a measure of 
the lifetime. 
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Figure 1.1 Schematic diagram of a production process. 


Systems that have finite lifetimes are called quasistationary or metastable 
states, in distinction to the stationary or stable states, which are “infinitely” 
long lived. Below in Section XVIII.6 we will justify the name resonances 
for the quasistationary states. When to consider a state as a resonance and 
when to consider it as stable is a matter of the accuracy required of the 
description; the distinction between (relatively) stable states [particles like 
He*(S)] and unstable states [resonances like He*(D)] is in principle quantita- 
tive rather than qualitative. 

There are two main types of experiments in which resonances can be 
obtained: formation experiments and production experiments. The process 
(1.1) is an example of a production experiment, which experiments are 
characterized by processes of the form 


a+T>ob+R>b4+c44, (12) 


and which are depicted by diagrams like that of Figure 1.1. Production 
experiments always involve inelastic scattering processes. Formation 
experiments, on the other hand, consist of simpler processes 


a+T>R-a+T' Clea) 


and are illustrated by diagrams like that of Figure 1.2. 
A beautiful example of a resonance obtained in a formation experiment in 
atomic physics is the Schulz resonance in the elastic scattering process 


e +He—~He —He+e. (1.4) 


If one measures the elastic scattering cross section as a function of the scatter- 
ing energy (kinetic energy of the projectile e~ with respect to the target He), 
one observes that at the energy E = Eg = 19.31 eV something unusual 
happens. At this value the cross section changes violently, as shown in the 
data of various elastic-collision experiments [cf. Figures 1.3, 1.4, and 1.5, 


Figure 1.2 Schematic diagram of a formation process. 


402 Resonance Phenomena 


1! 


2| 3| 4| 5 | 3p 

2| ; 3| 4| 5 | 15 

2| 3| 4| | || °s 
Helium 
energy 


levels 


Transmitted current (arbitrary units-zero displaced) 


19 20 21 722 23 24 25 
Electron energy (eV) 


Figure 1.3 Typical current-energy plots for electrons in helium as 
observed by C. E. Kuyatt, J. A. Simpson, and S. R. Mielczarek. 
[From Physical Review 138, A385 (1965), with permission. ] 


which show the observed cross sections (Figures 1.4a, 1.4b, 1.5) as well as 
the intensity of the transmitted electron current, i.e., the intensity of the 
current of electrons that are not scattered (Figures 1.3, 1.4b, 1.4d)]. We shall 
show that the effect shown in these figures can be explained from the hypo- 
thesis that He and e~ form a long-lived compound that belongs to one of 
the energy levels of the He~ ion, as indicated by Equation (1.4). 

A resonance is usually defined as a sharp structure in the cross section 
together with a rapidly increasing phase shift going through 7/2. In the 
remainder of this chapter we shall see that in general the formation of a 
quasistationary state is connected with these effects, and shall study the 
characteristic properties of the S-matrix and phase shifts for these phenomena. 
We shall in this chapter restrict ourselves to the case where only one channel 
is open. This means that we are in an energy range below the first inelastic 
threshold, where only the elastic channel 


a+T>R-a+T ‘6 ES)) 
is open, so that the only reduced S-matrix element different from zero is 


S({E,) = <nallSKE Ding = et 


(Recall that 4, are the quantum numbers that determine the internal state 
of the a-T system; we omit the label 7, on S, and 6,.) 

Resonances in a multichannel system will be discussed in Chapter XX; 
it will be shown there that with a few changes multichannel resonances may 
be treated in much the same manner as the single-channel resonance case 
studied in this chapter. 
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Figure 1.4 (a) Variation of intensity of scattering of electrons at 72° by helium atoms 
with electron energy, as observed by Schulz, Phys. Rev. Lett. 13, 583 (1964). (b) Variation 
with electron energy of the transmission of electrons through helium as observed by 
Simpson, Phys. Rev. Lett. 11, 158 (1964). (c) Variation with electron energy of the total 
cross section for elastic scattering of electrons by helium atoms, as observed by Golden 
and Bandel, Phys. Rev. 138, Al4 (1965). @, Experimental points. (d) Variation with 
electron energy of the transmission of electrons through helium as observed by Golden 
and Nakano, Phys. Rev. 144, 71 (1966). 
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Figure 1.5 Variation with incident electron energy of the intensity of 
electrons elastically scattered in helium at different angles of scattering. 
The intensity variations may be judged from the lines shown on the 
right-hand side, which indicate 10 percent of the total intensity near 
the resonance. From D. Andrick and H. Ehrhardt, Z. Phys. 192, 99 
(1966). 


XVIII.2 Time Delay and Phase Shifts 


The lifetime of a quasistationary state formed in a scattering experiment of 
the form (1.3)? is roughly measured by the time by which the projectile is 
delayed in the scattering region as a consequence of the interaction. Let &, 
the interaction region around the target, be characterized by a radius R, 
which is chosen large enough such that there is no interaction outside of &. 
This is possible because the interactions encountered in practice do have a 
finite range. We describe the state of the projectile-target system in the 
Schrodinger picture by the statistical operator W(t). We also imagine a 
fictitious noninteracting system described by the operator W'%(t). If H = 
K + Vis the total-energy operator, then the time development of W(t) is 
generated by H, while that of W'\(r) is generated by K (cf. Equation 
(XIV.5.5)]. Since we shall consider only elastic scattering, with the target 
remaining in the pure state W; = |74><n,4|, we can ignore the factor W; in 
wi? = Wi? x W, and consider W as referring to the projectile alone. 
W(t) then describes the state as it develops when the target is present in the 
center of #, while W'"(t) describes the state when there is no target in &. 


>To keep the discussion simple we assume that projectile mass remains unchanged, i.e., 
mM =m, — Mm. 
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The probability that the projectile is in the region & at the time f is given 
by 
I(t) = { d3x <x|W(t)|x> (2.1) 
R 


(cf. Section II.10 and II.11), where |x) are the generalized eigenvectors of the 
projectile position operator with respect to the target. The total time the 
projectile spends in & is then 


T= i dt I(t) = ie dt [ d?x <x|W(t)|x>. (2.2) 


If there were no interaction between projectile and target, the time spent in 
R would be 


— a ue w* : i, 
[a [atx cxiwne@ixy 23) 


The time that the projectile is delayed by the interaction with the target is 
therefore 


(aT Te = L dt { d?x (<x|W(0)|x> — <x|Wi%(1)|x>). (2.4) 
= R 


This “time delay”** may be positive or negative. If a quasistationary state is 
formed it will be positive and large. 

We now investigate the relationship of the time delay t\” with the phase 
shifts and the S-matrix. We insert complete systems of K and H eigenvectors 
into Equation (2.4) and use Equations (XIV.5.5) and (XIV.5.21) (which was 
proved in Appendix XV.A) to obtain 


R 
t'") 


[ ar | deaicxia dale “Wwe la * ya |x> 
ire, wee 


aa’ 


— <x]a><ale""* Wire" a’) <a'|x)} 


fee) 
i dt ¥ e7 Ea Faq] Witla’) 
102) aa’ 


x | d?x{<x|a*) <a’* |x) — <x|a)<a’[x>}. (2.5) 
R 


& is the region within which the interaction takes place. The wave function 
¢x|a*> in this region depends strongly upon the particular nature of the 
interaction. On the other hand, the wavefunction has a very general form 
outside the region &, where the projectile moves freely, the effect of the inter- 
action being expressed by a phase shift relative to the asymptotic form of the 
free wave function <x|a). Since we want to obtain statements about the time 
delay that are very generally true, we attempt to replace the integral over # 


2" Time delay has been introduced previously in a similar way by F. T. Smith: Phys. Rev. fff, 
349 (1960). Time delay functions computed for particular potentials are given in R. J. LeRoy, 
R. B. Bernstein: J. Chem. Phys. 54, 5114 (1971). 
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by an integral outside of Z. To this end, we first show that the time delay 
over all space (R — 00) vanishes. Denote all space by co, and the region out- 
side & by (co — &). Then 


| d?x {<x|at) <a't |x) — <xla) <a’ |X} 


= { d?x {<a’* |x) <x|a*) — ¢a’|x><x|a)} (2.6) 
= <a't Jat) —Ca' a), 


where we have used the relation 
i HERA ORG 1h. 


According to Equation (XIV.2.9a,b), the generalized eigenvectors of H and 
K have the same normalization. (This is proved in Appendix XV.A.) Thus 
the right-hand side of (2.6) and consequently t$°) vanish. The time delay 
over region & can then be written as the time delay over (co — &): 


oO 
ig = \ dt Y e~ Ea- Fag) Wir|a'y 
S100) 


, 
aa 


xf dx {exlad<a'lx> — Gxla* >a" 1X}. 7) 
(00 — #) 
Assuming spherical symmetry, 1.e., [H, L;] = [.K, L;] = 0, we may choose 


angular-momentum bases {|Ell,y*>} and {|Ell,n>} for {]a*>} and 
{|a>}.> Then the time delay is given by 


+ co 
i?) = | dt’ | p(E) dE p(E') dE’ e~£- FEI, | W'"| E'll,> 


—O ly, 
x | r? dr {<r| E>, <r | EDF — r|E*>, rE PF}, (2.8) 
R 
where p is the normalization function of (XIV.2.9a), and where the relations 
<rOP|Ells> = Yu,(8, $)<rlEDi, (2.9a) 
<rO$| Ells > = Yu,8, 9) <TIE* >, (2.9b) 
[ a Yi (Q)Ni,Q) = dv 91,1, (2.9¢) 


have been used. We can write (2.8) in the form 


(5? = [de Y. | p(R)dE ple) dB’ e-"*—CEN,|W|E'Ns)J, (210) 


U3 


> Here 9 can be any set of additional quantum numbers, and we shall usually suppress the 
label y. 
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where 
oS i, r? dr {Cr| ED <r|E>F — Cr E*)>, <r E'* #3. (2.11) 


In Equation (2.11), which holds for all R larger than the radius of the inter- 
action region &, we choose R to lie in the asymptotic region. The integral J 
can then be evaluated using the asymptotic forms (X VII.4.17) and (XVIL.4.18) 
of the exact and free radial wave functions.** Though derived under the as- 
sumption of a Hamiltonian time development, these asymptotic forms exist 
independently of H, depending as they do only on the S-matrix through the 
phase shift 6,. We have, therefore, 


Pg ie 1 : (518 | ee 
J= = I r2 dr ae isn (mr _ *) sin (° r— 3| 


’ woe l 
— gild(P)- 41?) gin (mr » . 4 540)) an (n° eS ud i 5400). (2.12) 


Because the t-integration in (2.10) gives a factor 6(E — E’), one only needs 
to evaluate J in the limit E’ > E, 1.e., p' > p. The evaluation of this integral 
is an exercise in distribution theory, with the result 


wk [eae 


J(p > p’) MeN as 


[The calculation that gives (2.13) proceeds in the following way. 
We rewrite the integrand in (2.12) as follows: 


(1) - cos(2pR + 6,(p)) sin s4n)h. (2.13) 


+ 00 


npp'J = ; i dr Or — R){eP—P [1 — @2orp)— HP] 


ais (= 1)'[et +P’ (e2iut) _ 1) alls e MPT P(g 287’) ae DBS 
(2.14) 


The integral over the first term in the integrand is the Fourier 
transform of the 6-function and may be calculated using Equation 
(XV 2225): 


1¢t2 ; ae 
first term = — | UO Re Pere ee ete) 


ie 8) 
= 411 a enieinrairre= rt | dr&(r — R)eile— Pr—R) 
mail 8, 


i 


[1 — e2ieur)— dlp) pte PR (2.15) 


ne 
: p — p'+ i0 


34 Only for / = 0 are the exact and free radial wave functions given by the trigonometric 
function such that for / = 0 (2.12) holds for any value of R outside the interaction region and 
one can choose for R the smallest possible value, the effective radius of the scatterer. The larger 
the value of / the larger one has to choose the value of R in order to be in the asymptotic region. 
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Using the relation* (VII.2.5): 


1 i 
Pp nO pp 


+ ind(p — p’), (2.16) 
we rewrite (2.15), noting that the contribution of the term containing 
O(p — p’) vanishes, since 

ip — p)T1 - e7ildi(p) _ dP} =i) 
We are left with : 


(2.17) 


7° 


‘ = ' Fert l 
first term = 41 — g2i(di(p)— dip Wei? p’)R - 


Now let p > p’. Expanding the exponential in the square brackets, 
we get 


2.18) 
Pp dp \ 


The integrals over the second and third terms in (2.14) are obtained 
similarly, with the result 


poem dé 
first term — 4[ —2i(6,(p) — 6,(p’)) Je”? 5 oa (P) 


pile + POR = 
second and third terms = (—1)!| i ————— - 3(e* 7") — 1) 
ae iy oe 
eet PR 


: 4(e7 2i6;)(p") __ 0] ‘ 


(2.19) 


= 
p+p —i0 


Here there is no problem with taking the limit p’ > p, and after 
elementary simplification one obtains the second term on the right- 
hand side of (2.13). | 


If we insert J of (2.13) into (2.10), perform the integration over t [letting 
the bound of the integral go to infinity and using (XIV.5.19)], and then inte- 
grate over E’, we obtain 


1 = 2) | p(E)dE CEll,|W"|Ell,> 


tls 

a Ee z 
pL ap 
Since the expression in the square brackets does not depend upon /,, we 
can sum over that component of the angular momentum. Denoting 


>, <Ell3|W'*| Ell,» = W(E) = Wp), (2.21) 


Is 


1): 7 cos(2pR + 6,(p)) sin 540). (2.20) 


* Gel’fand and Shilov (1964, Vol. 1). 
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which because of the normalization of W'" fulfills 


> | o(E) dE <Ell,|W*"| Ell) =. | p(E)dEW(E)=1, (2.22) 
I 


31 


one can write (2.20) as 
1 =, | p(B) de WES) (2.23) 
I 


where for a nonrelativistic particle with mass m, the quantities p, E and p 
are related by [cf. (XIV.5.11) and (XIV.5.18a] 


E=p*/2m, __ p(E) = mp(E), (2.24) 
and where we have defined 
1 [d6 —1)! 
(SME) = 2B) Ee E 2 cos(2pR + 5,(p)) sin 510) 


_ 4)! 
= 2m Ee = <2 -A(sin 2(:pR + 5,) — sin 20k), (2.25) 


The second term on the right-hand side of (2.25) is of limited variation 
and depends upon the radius R of the region &, which can be arbitrarily 
chosen, the only restriction being that it be greater than the range of the 
interaction.** We therefore average over R to obtain a quantity that does not 
depend upon the irrelevant exact choice of the radius R and is characteristic 
of the scattering process only. This average of t” is 


1 d5(p) _ 1 ddiv) _ , dd(E) 
p> dp p dp dE 


tp(E) = 2p(E) (2.26) 


The average time delay, which for short we again just call the time delay, 
is thus 


a= y { p(E) dE W(E)t!(E). (227) 


The incident beam W’" is usually prepared, not with a definite value of the 
orbital angular momentum /, but with a momentum pointing in a well- 
defined direction. W,E) then describes the weight of the /th angular- 
momentum component in the mixture W™. However, if one arranges that 
the incident “beam” has a well defined angular momentum /,, then one has 


pP(E)W(E) = 64, F KE — Ex), (2.28) 
where F, (E — E,) fulfills, because of (2.22), the condition 


fae F, (E — E,) = 1 


4a Note that for | # 0 R in the above derivation has to be chosen in the asymptotic region. 
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and describes the energy distribution in the “beam” [cf. XIV.5.35)]. The 
incident beam is usually not monoenergetic. However, if it can be considered 
monoenergetic, then [cf. XIV.5.36)] 


F, (E — E,) = o(E — E,). (2.29) 


Inserting (2.29) and (2.28) into (2.27), one obtains for the time delay of this 
particular “state” with angular momentum |, and energy E, 


ty = ty(E,). (2.30) 


Thus the quantity (2.25) or (2.26) is the time delay occurring in the scat- 
tering of a state with angular momentum / and energy E. And the time 
delay in the state W'" (2.27) is the weighted average of the time delays 
t!,(E) over all angular-momentum and energy values. 

Restoring the dependence upon the additional quantum numbers 4, 
the time delay of a “state” with quantum numbers y/E is given by: 

cE) = OULD _ 2m dB) 
dE p dp 

A monoenergetic beam describes a steady state for which time delay 
really has no observable meaning. For a measurement of the time delay 
one needs a beam of pulses of finite duration and, therefore, of nonzero energy 
spread.° Then the quantity tp of (2.27) and also the time delay for a particular 
angular-momentum value, 


(2.31) 


doj(E) 
dE ’ 


nS fae F(E — E,)t(E) = 2 [ae F(E — E,) (2:32) 
can —at least in a Gedanken experiment--be measured as the difference in 
time that a pulse of projectiles needs to pass through a region @ with and 
without the target present. Scattering experiments are, however, usually 
not carried out in a way that allows the measurement of such a delay, and 
the significance of (2.31) or (2.26) lies in its theoretical consequences, which 
we shall discuss now. 

If during the scattering process for a particular set of values (m/E) = 
(nrlpEp) a quasistationary state is formed, that means, if the projectile is 
temporarily captured by the target, then for this set of values the delay time 
t',(E) must be large. Thus the condition for a quasistationary state is 


doj(E) 


IE has a sharp positive maximum at (7p/p Ep), (2.33) 
which in particular means that 
d?54"(E) aP5(E)| . 
is ae = 0, aa is large. (2.34) 


> Cf. the discussion in Section II.11. 
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The main contribution in the expression (2.27) for the delay time then 
comes from the term with these particular values (I; E,). 

It may, of course, happen that for a particular scattering system all 57(E) 
are such functions of E that the d5#(E)/dE do not have any maxima, and 
therefore no quasistationary state exists. However, more often than nof at 
least some of the d6?(E)/dE do have such isolated maxima, and we shall 


investigate the consequences of such an occurrence below, in particular in 
Section XVIII.6. 


XVIII.3 Causality Conditions 


The time T = T™ which a projectile spends within a certain region @ of 
radius R around the target must always be positive. Thus the condition 


S20) 


is obviously fulfilled. This condition is, through the definition of T in (2.2), 
connected with the von Kampen causality condition, 


{ d3x <x|W(t)|x> =O at any time ¢, (3.1) 
_ 


which is also obviously fulfilled as it represents the probability to find the 
projectile in the region &. The strongest form of the condition (3.1) is obtained 
if one chooses for R its smallest possible value, i.¢., the effective radius of the 
scatterer. 

The time T‘®? that the projectile spends in the interaction region can be 
obtained according to (2.4) from 


TA) = A) 4 Tit, (3.2) 


where ¢{ is given by (2.23) and (2.25) and Ti" is given by 
or les! , 
TIM) — | dt Y i p(E) dE p(E’) dE’ e~*®- © El,| W*"| E'll,)(—J*) 


fee) Ul 
Ga) 
with Ji" given by 


R 2 rR 
(-J) = [Pdr IED rlEDY =“ [ Paritoni@n. G4) 
fy) 0 


Equation (3.3) with (3.4) is obtained by the same arguments that led from 
(2.4) to (2.10) with (2.11), except that in (3.4) we did not have to replace the 
integral over the interaction region # by an integral over («70 — -#). 
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The integral in (3.4) is calculated using (XVII.3.9b) and (XVIL3.7a), the 
result being 


2 R? 
— Ji") = 
( ) wp + ppp} 


hs is ( : sin( n—8)sin (oe i ”) 
~n(ptP)p—p)\pR, 2 2 


1. TL) eee ml 
oR sin (re — =e) sin (> R- 5 }): (3.5) 


For the second equality in (3.5) the asymptotic form (XVII.3.6a) was used, 
which is justified for R in the asymptotic region or in case | = 0 for R just 
outside the interaction region. Using well-known relations between trigono- 
metric functions, this becomes 


(p'7(PRyi- i(p'R) — Pji+1(PR)ip'R)) 


| eee eee (p — p')R cD sin(p + p')R (3.6) 
T pp’ = p’ Pp - p’ os 
and for p' > p 
. 1 (-1)'. 
—J"™=—,[(R-—- : ; 
mi ( > sin 20R (Se) 


This expression is inserted into (3.3) after one has performed in (3.3) the 
integration over t using f2% e#* dt = 276(x) and over E’ which leads to 


2R (1) 
Tit@) — >) 0(E) dE <Ell;| Wi" | Ell,> (= = ( 5" sin 2pR). (3.8) 


U3 


With the definition (2.21) and (2.24) one obtains 


TA — Y foe dE W(E)T™®(E) (3.9) 
1 
where 
¢ 2R (14 
Tina E = a : 2 . ; 
(E) nD OEY: sin nER) (3.10) 


Equations (3.9) and (3.10) have been written in complete analogy to (2.23) 
and (2.25). The quantity T'*"(E) is thus the time which a “state” with 
angular momentum / and energy E spends in the region # if no interaction 
takes place. The time T™ is in analogy to (3.9) and (2.23) also written as 


TH =Y [oe dE W(E)T"(E), (3.11) 
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where according to (3.2) 
Ty = 1B) Et ne), (3.12) 


Inserting (2.25) and (3.10) in (3.12) and using some elementary relations 
between the trigonometric functions, one obtains 


T(E) = me & + k) —(-1) ie sin 2(pR + 6(p)) — (3.13) 
p \dp p 
T(E) is the time that the “part” of the projectile with energy E and angular 
momentum / spends in the interaction region. 
The condition (3.1) must be fulfilled for any arbitrary state of the projectile, 
1e., for any set of positive functions W,E). Consequently we conclude 
from (3.11) and (3.1) that 


T(E) >0 for alll and E. (3.14) 
This condition (3.14) can be rewritten using (3.13): 
do, sin 2(6, + pR) 1 
> —R 1)’ —————" > — —|}. : 
i + (—1) >p = R+ > (3.15) 


In the form (3.15) the condition is called Wigner’s causality inequality. The 
smaller R the stronger is the condition (3.15); but only for | = 0 can one 
choose in our above derivation for R the effective radius of the scatterer. 

Wigner’s causal inequality states that the phase shift cannot decrease faster 
than at a certain rate. Thus if the phase changes rapidly, then it must be 
increasing. We shall make use of this fact below. 

The condition (3.1) is so obviously fulfilled that one may wonder why the 
conditions (3.1) and (3.15) were given the name causality conditions. The 
connection with causality becomes most apparent if one rewrites (3.15) into 
yet another form using (3.12) with (3.10): 


tp P(E) > — TIME) = -|"> ee ee m sin an 7. (3.16) 
The first term on the right-hand side and on the mies side of (3.10) 
is the free time of flight that a classical projectile with velocity m/p would 
need to traverse the region of diameter 2R. The second term represents 
the quantum effects; it is negligible for sufficiently fast particles, pR > 1, 
and arises from the fact that quantum particles cannot be localized within 
dimensions smaller than a de Broglie wavelength. 

If the region of diameter 2R is the interaction region in which the projectile 
interacts with the target, then—depending upon the kind of interaction—the 
projectile may be delayed (tp > 0) or hastened (tp < 0) in it. It may be 
delayed an arbitrarily long time, and we shall discuss the phenomena 
connected with it in the subsequent sections. However, it cannot be advanced 
an arbitrarily large time, because according to causality outgoing projectiles 
cannot appear before the incoming projectiles have reached the scattering 
region. Classically, the maximal time advance occurs if the projectile has 
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hardly entered the interaction region when it is repelled at the surface, and 
leaves the interaction region @ at a time m2R/p earlier than a particle that 
passes through such a region without interaction—i.e., its time delay is 
—m2R/p. A more negative time delay is not possible, because then the 
outgoing projectile would have to leave the surface of the interaction region 
before the incoming projectile had reached it. For quantum systems these 
arguments have to be corrected by the term (m/p)(sin 2pR)/p, and then lead 
to (3.16) as a mathematical formulation of the above statement of causality. 


. 


XVIII.4 Causality and Analyticity 


In Chapter XVI we considered the /th partial T-matrix elements 
<n, T(E)|ing> and the Ith partial S-matrix elements <n,||SE)\\n4>; in 
particular we considered the /th partial T- and S- matrix elements for elastic 
scattering, T(E) = T,(p) and SE) = S{p), where the momentum p and 
energy E of the projectile of mass m are connected by E = p?/2m. If the 
interaction is described by a Hamiltonian H = K + V, then T and S are 
connected to the interaction Hamiltonian V by expressions like (XI'V.5.41), 
but as mentioned above (e.g., in Section XIV.4), the notions of S- and T- 
matrix make sense also if there exists no Hamiltonian time development. 
Then the S- or T-matrix is the ultimate basis for the description of all the 
information required to compute observable quantities, such as cross sections 
[(XVI.1.26), (XVI.1.29), (XVI.1.30)]. The properties of the S-matrix then 
follow from general physical principles believed to be satisfied by any inter- 
action. One kind of general physical principles are symmetry principles, 
and we have, in fact, already made use of such a symmetry principle, namely 
the rotational symmetry (XVI.1.1), when we derived that the S-matrix 
elements are independent of /,. The general physical principle that we shall 
utilize now is the causality condition discussed in the preceding section. 
The causality condition leads to analytic properties of the S-matrix elements 
when the energy and momentum of the projectile are extended to complex 
values.© When the interaction is described in terms of a potential function 
V(r), then this kind of analyticity of the S-matrix can be derived from the 
properties of the potential function.’ Following the spirit of the presentation 
in this book, we shall not start from a potential function, but establish some 
analyticity of Sp) as a consequence of causality. We shall not give an exten- 
sive discussion of this subject, especially as not all questions on this subject 
have been answered yet, and content ourselves in this section with the formu- 
lation of the analyticity properties of Sp) that will be needed in the sub- 
sequent sections in connection with resonance phenomena. 


© One obtains dispersion relations when these analytic properties are expressed in terms of 
integral relations between different matrix elements for real values of the variables. 


7 Taylor (1972, Chapter 12). 
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The probability of finding the projectile at an arbitrary time t anywhere in 
space is unity: 


1 = | d?x <x|W(t)|x> = | d3x <x|W(t)|x> + il d°x <x|W(t)|x). 
— R (0% — #) 
(4.1) 


Consequently the van Kampen causality condition (3.1) can be reformulated 
to read 


d°x <x|W(t)|x> <1 at any time t. (4.2) 
(0 — #) 


Inserting a complete system of basis vectors | Ell} > and using (XIV.5.21) 
and (2.9a,b,c), one obtains for (4.2) [by the same calculation that led from 
(2.4) to (2.7) except for the change from the integral over & to the integral 
over (co — &)]. 


yi [ | o(E) dE p(E’) dE’ e~*E-) Ell,| W"| E'lls) 
lls 


x CE > CE |p dr = 1. G3) 
=.# 


r 


In the region (« — &), where no interaction takes place and the projectiles 
move freely, the wave function <r|E*>, = <r|p~ >, has the general asymptotic 
form (XVII.4.17), in which the effect of the interaction is expressed in terms 
of the S-matrix S,(p) = e?!): 


yaaen | 
ee foe (pier —ini2g — eg ilpr—tx/2)y 44 
<r|p"> iE Dipr (e (p) — e ) (4.4) 
Inserting this into the integral over r in (4.3), one obtains for R from the 


asymptotic region 


aol 6] 
integral = i r? dr O(r — R)<r|p* <r |p" >t 


io 9) 


+0 
=i eter {SulPrstlr) | par = Rye esa 
m 4pp ae 


+ 00 
: | drO (r — R)eW i Pv6-®) 


+0 
—¢e "*S,2(p) | dr O(r — Ryell? Pe -®) 


+o 
—e'*S*(p’) | dr Wr — Bere, 


416 Resonance Phenomena 


where we have defined® 


Sir(p) = e7!PRS (p). (4.5) 
The Fourier transform of the @-function has been calculated in (XV.2.25+ ); 
using that result one obtains 


—i(p— p)R 


2 
integral = 7 ae {Susi 


l 
p—p +i" p—p+i0 


as -0'(s1 Oe a Su) +a) 


Using the relation between the distributions, (2.16) = (VIII.2.5), this can 
be rewritten 


1 
; —i(p— p')R 
integral = ahd 


+ (—D.Ci@) = Sia(P)) 17 


+ n6(p — P’)(Siz(p)Sia(p’) + YD 
+ m—1)'5(p + p’)(Sie(p’) + * 
Inserting this integral into (4.3), one obtains, using p dE = p* dp, 


> [ ie Pp cB ce Gee = pil, 
Us 


e iip- p’)R 


« {a — Sir(P)Sii RP) 5 RP) — Sir(P)) — 


mO(p — p')(Sig(p)Six(p') + 1) — n(—1)'(p + p’)(Si(y’) + SuiPD} <1 


i(p + p’) 


(4.6) 


Performing the integral over p and p’ in the last term gives a zero contribution 
because of o(p + p’). After integrating the next to the last term over p’ and 


8 The physical meaning of the exponential factor e~ 7'?* in 
S,(p) = eee = e2(— PRIS, (p) 


is that it represents the phase advance corresponding to a path difference from the surface of the 
interaction region to the center and back, According to the description of Figure XVII.4.1, 
a repulsive potential gives a negative phase shift. An impenetrable sphere of radius R has a 
phase shift of —pR, because the wave is reflected at the surface of the sphere, as compared with 
the one going through the center. Correspondingly an outgoing signal can appear up to 2Rr = 
2Rm/p earlier than would have been possible in the absence of the scatterer. S,p(p) then describes 
the deviation of the effect of the scatterer from the effect of an impenetrable sphere with the 
radius of the interaction region. 
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using the unitary condition S,,p(p)S#(p) = 1, one obtains the contribution 


3 dp p’<pll,|W'"| pll,> =$-2=1. 


Us 


For t = 0 the above inequality (4.6) therefore becomes 


» i} { p> dp p’* dp’ <pll,;|W*"|p'll;> : eo tp PIR 
lz, 40 40 2npp 
1 
S.0)S0) — 1) 
i(p — p’') 
oe [ dp p’”* dp'<plls |W'"|p'lls> pe PB 1)! 
as 
x (Sik(P') — Sir(p)) (4.7) 


i(p + aa 
As this has to be fulfilled for any state W'", it follows that a corresponding 
inequality must hold for any term in the sum )°,,, separably and for any 
physically permitted momentum distribution function. 

One can write (4.7) in a different form by introducing for Wi" the 


momentum wave function A,?(p) [in analogy to the f(p — p’) of (IL.11.1p)] 
by 


Ag (p)AR(p’) = pe '?*<13 Ip| W'"|p'll,e'? Xp’. (4.8) 


Thus one obtains from (4.7) 


1 
ls TE ee 
[. ie dp dp’ Ax*(p)AR(P’)(Sin(P)Sik(P) — a 


=|. [ dp dp’ (— 1 Al(p)AB(p')(Se(p') — Sia(p)) = (4.9) 
i(p ay 


for every value of / and |, and for any well-behaved function (ie., for any 
element of the Schwartz space), Az°(p). 

From the form (4.9) of the van Kampen causal inequality it follows, by a 
purely technical proof that we shall not reproduce here:? 


S(p) has an analytic continuation without singularities in the first 
quadrant 0 < arg p < n/2 — 6(6 > 0) of the complex p plane. (4.10) 


In fact, one can prove more about the analyticity property of S(p) than just 
this, and in Section XVIII.5 below we shall summarize some results. It is, 
however, only (4.10) that we shall need for our derivation of the Breit- Wigner 


°N. G. van Kampen, Phys. Rer. 91, 1267 (1953), Section II [for odd / all statements are for 
—S,(p)]. 


418 Resonance Phenomena 


formula in Section XVIII.6. We wish to emphasize that (4.10) holds for 
any value that one chooses for R, as long as it is finite. 


XVIII.5 Brief Description of the Analyticity Properties of the 
S-Matrix 


Though a derivation of the following result on the analyticity properties of 
Sp) exceeds the scope of this hook, and we shall also not make explicit 
use of these properties in what follows, the knowledge of these properties 
will greatly enhance the picture of the S-matrix. We shall, therefore, state 
these analyticity properties here with a few connecting remarks.!° These 
properties are independent of the specific form of the interaction and follow 
from very general principles—essentially from the causality condition in 
the form (4.9), 

In order to obtain the properties of Sp) beyond the first quadrant, we 
have to continue S,(p) across the imaginary axis and into the lower half 
plane. In order to do the first we have to know the properties of S,(p) on the 
positive imaginary axis. We know already from the discussions in Section 
XVIL.5 that bound states correspond to poles on the positive imaginary axis, 
so that we cannot expect S,(p) to be analytic there. Figure 5.1 shows these poles 
and zeros, which are, of course, also possible on the real axis. 

Asa further consequence of the causality condition (4.9) one can prove that 

Re p : 
Im S,2(p) < (— 1)! re ImS,(p) <1 in Oisdanepeei2 G1) 


and 
| Sir(p)| is bounded in the first quadrant.” (5.2) 


This is not sufficient to obtain information about the nature of the singular- 
ities on the imaginary axis, except to conclude that if these are poles, they 
cannot be of higher than the first order. 


10 For more on this subject see Nussenzveig (1972). 


'l This is a very weak condition for large values of R. because it allows S, (p) to vary rapidly 
on the real axis also in the absence of any poles or zeros. But in the above derivation of the 
causality condition (4.9) R has to be chosen from the asymptotic region because (4.4) is the 
asymptotic form for the radial wave function outside the interaction region. The smaller the 
value of /, the smaller can one choose the value of r for which the asymptotic form (4.4) is valid. 
But only for the case /| = 0 is (4.4) identical with the radial wave function outside the inter- 
action region. Therefore, only for / = 0 can one choose for R the smallest possible value namely 
the effective radius of the scatterer. In order to obtain the strongest result, i.e. the above state- 
ments for R equal to the effective radius of the scatterer, also for / > 0, one has to take instead 
of the asymptotic form (4.4) the exact expression for the radial wave function for r outside the 
interaction region: 


<rip*>, = /4n(S(p)h(pr) + h¥(pr)) (4.4) 


where h,(pr) and hj(pr) are the spherical Hankel functions (XVII.2.5). The mathematical proof 
that (5.1) and (5.2) follow also if one takes (4.4’) instead of (4.4) has not yet been given. 
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In order to conclude anything more, one has to make an assumption in 
addition to the causality condition. If the interaction is described by a 
Hamiltonian H, this assumption amounts to the requirement that H be a 
semibounded operator, i.e., there is a lowest energy eigenvalue B and the 
binding energy cannot be infinite. In the case where one does not have a 
Hamiltonian time development and the bound states are described by 
singularities on the imaginary p-axis, one would assume this to translate 
into the statement that there are no singularities of S$,(p) on the imaginary 
p-axis above the value iK, where K = +./2mB. 

It has in fact been shown’ that from a precise formulation of the finiteness 
of the binding energy it follows that 


Rep . 
Im S,,(p) > —(—1)! ae in the first quadrant, (5.3) 


which together with (5.1) gives 
Im S;r(p) > 0 for Rep—-0,, (5.4) 
so that 
S,(p) is real on the positive imaginary axis except between 0 andiK. (5.5) 


For scattering states the physical values of the momentum p must be 
positive; however, p occurs quadratically in the Schrédinger equation and 
also in the Lippman-Schwinger equation. Consequently the equations are 
invariant with respect to the change of sign of p, and therefore, if p is replaced 
by —p in the radial wave function, the resulting function must again be a 
solution of these equations. Replacing p by — p in (XVII.4.17d) gives 


us + Cea I 1 L—ipr = ipr 
fio me mame)! Ne ae (Peo): 


(5.6) 


As <r|p* > and <r| —p* > describe the same physical content, the right-hand 
side of (X VII.4.17d) and (5.6) must both describe an incoming spherical wave 
and an outgoing spherical wave modified by the scattering matrix. Therefore 
one requires that for negative values of p the § matrix element S, be given by 


Sy “(—p) = Sip) (5.7a) 
Or 
S(—p) = S; *(p). (5.7b) 
Using unitarity, 
S(P)ST(p) = 1, (5.8) 


12 N. G. van Kampen, Physica (Utrecht) 20, 115 (1954). 
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x x 
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Bound state 
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Figure 5.1 Some poles (indicated by x) and zeros (indicated by 0) of the 
S-matrix element in the p-plane. 


one obtains 
S—p) = Sf(p) for real p. (5.7c) 
This is called the symmetry relation for the Ith partial S-matrix element. 

The /th partial S-matrix element is usually considered as a function of 
E = p?/2m, S(E). In the mapping from p to E the complex p-plane (see 
Figure 5.1) is mapped onto a two-sheeted Riemann surface [cf. XVII.Ai 
and XVII.Aj] with the branch cut from 0 to oo. The upper half p-plane 
Im p > 0 corresponds to the first sheet; the first quadrant of the p-plane 
corresponds to the upper half of the first sheet. The physical meaningful 
values of p in scattering states, p real, p > 0, correspond to the upper rim 
of the first sheet. The first sheet is called the “physical” sheet (Figure 5.2). If 
one continues through the cut one comes to the second sheet, called the 
“unphysical” sheet, which corresponds to the lower half of the p-plane. 
SE) is a function onsthe two-sheeted Riemann surface. The bound-state 


E-plane (physical sheet) 


Bound state poles 


Figure 5.2 Poles and cut in the first (“physical”) sheet of the energy 
plane. 
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Figure 5.3. Relation between zeros (indicated by ©) and poles. 


poles of S,(p) on the positive imaginary p-axis correspond to poles on the 
negative real E-axis. Statement (4.10) then means that 


SE) is analytic in the upper half plane of the physical sheet. _—_ (5.9) 
The relation (5.7c) for S, considered as function of E is then stated as 
S(E — ie) = S#(E + ie), E > 0. (5.10) 


From the Schwarz reflection principle (XVII.Ah) it then follows that S,(E) 
can be continued into the lower half energy plane and is given there by 


S(E) = S*(E*) (5.11) 


Since the lower half energy plane corresponds to the second quadrant of 
the momentum plane and E* corresponds to — p*, (5.11) leads to 

Si(p) = Si(—p*) (5.12) 
as an extension of the symmetry relation (5.7c). Thus, at points symmetrically 
placed with respect to the imaginary axis, S,(p) takes on complex conjugate 
values. As it has no singularity in the first quadrant, it has also no singularity 
in the second quadrant. If S,(p) has a zero at p,, as shown in Figure 5.3, then 
it will also have a zero at — pj. 

One can now continue S,p) into the lower half p-plane (Figure 5.3), by 
extending the relation (5.7a) to complex values of p. For every value in the 
second quadrant (5.7a) defines a value of S,(p) in the fourth quadrant, except 
at zeros in the second quadrant which produce poles in the fourth quadrant. 
In the same way (5.7a) defines a function analytic in the third quadrant 
except for poles coming from zeros in the first quadrant. Thus, S,(p) is in 
general not analytic in the lower half plane and if it has zeros above the positive 
p-axis it will have poles below the p-axis, the resonance poles of Figure 5.1 and 
their counterparts in the third quadrant. 

In this way we have seen that S,(p) is a meromorphic function in the whole 
p-plane that is analytic in the upper half p-plane except perhaps on the 
imaginary axis. The poles in the lower half plane occur in pairs, symmetrically 
distributed with respect to the negative imaginary axis, except for those on 


422 Resonance Phenomena 


E-plane (unphysical sheet) 


cut 
X X 
Virtual state poles Resonance poles ; 


Figure 5.4 Positions of possible resonance poles, virtual-state poles and 
cuts (capture state poles not shown) in the second (“unphysical”) sheet 
of the energy-plane. 


the negative imaginary axis, corresponding to zeros on the positive imaginary 
axis. [The number of poles can be infinite, but they cannot have an accumu- 
lation point for finite values of p, as an analytic function (in the upper half 
plane) cannot have a finite accumulation point of zeros.] The singularities 
on the imaginary axis can be shown to lie within the interval (—iK, +iK), 
where B = K?/2m is the maximal binding energy. Nothing can be derived 
from causality about the behavior of S,(p) in the neighborhood of this interval. 
It is, however, believed that for an interaction that is effective only in a 
region of finite size, one has poles, which must then be simple.'* 

If we translate these properties of S(p) into statements about SE) in the 
energy plane, we can summarize our results, which are principally a con- 
sequence of the causality condition, in the following way (Figures 5.2 and 5.4): 
S,(E) is a meromorphic function on the two-sheeted Riemann surface with a 
branch point at E = 0 and a cut from 0 to oo. The physical values of E in 
collision processes lie on the upper edge of the cut on the “physical sheet.” 
Bound-state poles lie on the negative real axis of the physical sheet, and 
except for these bound-state poles S,(E) is an analytic function on the physical 
sheet. Further poles (of any order) may lie on the second, “unphysical” 
sheet, coming from possible zeros on the first sheet. Poles at various locations 
on the “unphysical” sheet have various physical interpretations. Poles on 
the negative real axis of the unphysical sheet [coming from zeros of S,(p) 
on the positive imaginary axis] are called virtual-state poles. A virtual state 
is one that would be bound if the interaction were more attractive and a 


'3 Tf the interaction is described by a potential V(r) that decreases faster than any exponential 
for r + x», then $,(p) is meromorphic in the whole p-plane and analytic in the upper half plane 
except for a finite number of simple poles. On the other hand, for a Yukawa-type potential 
Vir) = JX o(we~“"/r du, m > 0, one has, in addition to possible poles of S,(p), also cuts along 
the imaginary axis from im/2 to ico and from —im/2 to —ioo. 
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virtual state close to threshold causes a large cross section at low energy; 
we will discuss such states in Section XVIII.7. Poles of SE) on the un- 
physical sheet, if they are close to the positive real axis, are of particular 
importance. They are called resonance poles or Siegert poles, and we shall 
study them in the remainder of this book. Each pole in the second sheet 
below the real axis has a counterpart in the second sheet above the real axis 
(capture states). 


XVIII.6 Resonance Scattering—Breit-Wigner Formula for 
Elastic Scattering 


Experimentally, resonances are usually associated with a sharp variation of 
the cross section as a function of energy. If the elastic or inelastic cross sections 
exhibit sharp maxima or minima, one says that a resonance has occurred. 
From (XVI.2.9) for the elastic Ith partial cross section one sees that a maxi- 
mum occurs either at the energies for which 6,(E) = 1/2 (modulo z) or for 
which 6,(£) has a maximum. The latter possibility, however, cannot lead to 
a sharp maximum, because according to the causality condition (3.15), the 
phase shift can never decrease with a steep slope. Thus a sharp maximum in 
the /th partial cross section occurs at those value of E for which 6,(E) = 1/2 
(modulo z); and the more rapidly 6,(E) increases by z, the sharper is the 
maximum. Similarly, a minimum for the elastic /th partial cross section 
occurs for 6(E) = x (modulo z). The connection between the change in 
cross sections and the change in phase shifts for these two cases and for some 
intermediate cases is shown in Figure 6.1. Note that as a consequence of the 
causality condition (3.15), 6(E) must be mainly increasing; if it decreases, 
this decrease can be only very slowly. In every case a sharp structure in the 
cross section o,(E) is connected with a sharp increase of the phase shift 
OE) by z. 

We therefore want to take as the preliminary definition of a resonance of 
angular momentum | at the energy Eg that 


6,(E) increases rapidly by approximately 2 when E passes through Ey. (6.1) 


If there is a sharp increase of one 6, by x around Eo, and if all other phase 
shifts are constant or slowly varying around E,, then rapid variations will 
also show up in the total cross section ¢ = )'o, or the differential cross 
section at certain angles. However, not all structures in the cross section 
should be ascribed to resonances. Resonances can usually be distinguished 
from nonresonant phenomena in the cross sections by the fact that resonances 
appear only in the single partial wave, while nonresonant phenomena are 
the result of cooperative contributions from many partial waves. 

We now want to explain the association of resonances and quasistationary 
states that we have already alluded to in the introduction (Section XVIII.1). 
According to our considerations in Section XVIII.2, a quasistationary 
state occurs at those values of E = E, for which dd,(E)/dE has a sharp 
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Figure 6.1 Phase shifts and resonance profiles. 


maximum. We therefore take as the preliminary definition of a quasistationary 
state of angular momentum / at the energy E, that 


d?5,(E) 9 Pa) co. 1 {aor d35,(E) 
Hee dE lee,» ES (GE dE? 


Whereas the first two conditions of (6.2) state that Ey is a maximum of 
dé, E)/dE, the last condition of (6.2) states that this maximum is to be very 
sharp. The first part of the conditions (6.2) is already incorporated in the 
drawings of the phase shifts in Figure 6.1, where the point E, around which 
6(E) changes rapidly is a point of inflection. However, it is not clear that 
the two phenomena characterized by (6.1) and (6.2) are related. We shall 
show in this section that as a consequence of very general physical as- 
sumptions (causality), these two phenomena are indeed connected: that a 
quasistationary state leads to a resonance and that a resonance leads to a 
quasistationary state. We will see that (6.2) leads to a pole in the /th partial 
S-matrix below and near to the positive real E-axis. And we shall obtain 
from (6.2) a representation of S,(E), valid in the neighborhood of Ey, from 


(6.2) 


Eo Eo 
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which (6.1) follows immediately.'* The notion of resonance poles was 
introduced in Section 5, when we described the analyticity property of the 
S-matrix without, however, giving a complete derivation. We concluded 
there that there may exist some resonance poles. Here we show that they 
appear as a consequence of the existence of a quasistationary state, (6.2), and 
provide therewith a physical explanation of their existence. 

To start our derivation we introduce the function 


f(E) = cone 


(6.3) 


which as a consequence of (6.2) has a sharp minimum at E = Eo, so that 
f' = f(E)e=e, = 0 and f” = f"(E)g-¢, is large. More precisely, a simple 
calculation shows that 


y dé ear as) 
a (Fz €)) 7E3 (Eo) (6.4) 
and consequently, 
2 do, a sie 
= <1 6.5 


from the last two inequalities (6.2). 
We want to expand f(E) in a Taylor series around E = Ey: 


s(@) = 5+ 5 — By)? + FE ~ Bo) (66) 

where the “remainder” term is 
(ba - f' (n) ; é ; 
BCE — Eo) =~, (E — Bo)" + ao Egle =: (6.7) 


The nth derivative of f at the value E = Eo, f = d"f(E)/dE"|;.~£, can 
easily be calculated to be 


f@ = ——. where 6" = — (6.8) 


The Taylor series, therefore, converges if for all n larger than a certain value N 


n 
(E — Eo)’ 


5M D << 5m (6.9) 
certainly not too stringent a condition. 

We shall first assume that F(E — E,) is negligibly small [F(E — Ey) = 0] 
in a certain energy range around E,; this would be the case if 6{* and the 
higher derivatives of the phase shifts at Ey could be neglected. In a second 
step we will discuss the corrections that arise from a small F(E — Eo). 


14 This derivation is based on Goldberger and Watson (1964, Section 8.5). 
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With F(E — E,) = 0 it follows from (6.3) and (6.6) that 
dE 
f + f"/XE — Eo)’ 


= ie arctan (" a _ +; 
Se Jip 


6(E) = 


= Fi arccoe —————— aus ES EE Ores (6.10’) 
Be 
6(E) = | 7a arctan a), + V1 (6.10) 


where y; and y, are arbitrary integration constants, y, — y; = 7/2. Equation 
(6.10) suggests the introduction of the two new parameters: 


2(6()3 
r= Fr ff 7 = ( a (6.11) 
2 [ 2690) 
= 2 Pe i om (6.12) 


r is dimensionless, and T has the dimension of energy. As a consequence of 
(6.5) (which followed from the requirement that there exist a quasistationary 
state at E,), I fulfills the condition 


2b (6.13) 


Inserting (6.11) and (6.12) into (6.10), one obtains for the /th partial S- 
matrix element 


SE) = ei ZOE) = ei2rarctan((I'/2)/Eo— E) i291 (6.14) 


The phase shift (6.10) is written as a sum of two terms: 


5(E) = OE) + v1, (6.15) 
where 6(*)(E) is a rapidly varying function due to (6.13): 
2 


6®)(E) = r arctan (6.16) 


E, —E’ 
whose value changes by almost rz when E passes through Eo. 7; 1s a constant 
under the assumption F(E — Ej) = 0. 

If F(E — Ej) is not zero, then one obtains instead of (6.10’): 


dE dE 
(E) Feycrusen cocoa f+ f"PUE — Eo? 
_ | eee dE FE _ Eo) a 7 (6.17 
(fe fe SEL ee ee: 72 
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Writing 6,(E) again in the form (6.15), we obtain for y, 
_ dE F(E — Eo) 
ey 2 Ef as ee Ey + FF) 


which for small F(E — E,) is a slowly varying function of E. 

Thus the phase shift in the neighborhood of a quasistationary state defined 
by the requirements (6.2) is the sum of a term 6{®)(E), which changes rapidly 
by almost zr, and a slowly varying term 6,(E): 


5(E) = 5{(E) + »(E), (6.19) 


e.g., as depicted in Figure 6.2. The quantity y,E) is called the background 
phase shift, or also the potential part !> of the phase shift, as opposed to 6)'(E), 
which is called the resonant part. Thus, if r = 1, we have derived the con- 
dition (6.1) as a consequence of the condition (6.2). In particular, if the 
background phase shift is zero, we have the situation depicted in Figure 6.1a. 

We should now, therefore, ask the question, what values the parameter r 
can take. In order to answer this question, we use the analyticity property 
(4:10).°° 

The function 


r/2 
aap 


has a branch point of infinite order at E = E, + iI /2 (and also one at 
E = E, —if/2).'’ If E moves from the real axis around this branch point 
and returns to the initial point on the real axis, always staying in the domain 
of analyticity, then In(E — Ey — iT/2) increases by 2mni, and 
arctan[(I'/2)/(E) — E)] by 2. Thus moving along this closed contour 
around the point E, + iI /2 results, according to (6.14), in the change 


Si) = Ss (byer™". (6.21) 


v1 = YE) = 


(6.18) 


arctan = 5 itn EF, hab) = in(E — E, — i1/2)] (6.20) 


On the other hand, since S,(E) is analytic in this domain, its value should 
not change. This is only possible if 


f= Ola... (6.22) 


We shall now discuss the simplest nontrivial case, r = 1. From (6.14) one 
then obtains 


Romer + (Den. iT . 
— toyr a ] Se Ba e yt 
SS ee Are ae 0/2 


= gi2ofPXE) i271, (6.23) 


1S Tf the interaction is described by a potential, then j, comes from the long-range component 
of this potential, whereas 6f comes from a short-range attractive component. 

© For the present discussion not all of the analyticity (4.10) is required; one just needs 
that SE) and, therewith, 7,(E) are analytic in a domain above the real positive energy axis 
that contains the point E = E, + iI’/2, a very weak condition indeed. 

‘7 Cf. Appendix XVIIIA. 
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Figure 6.2 Change of phase shift at a quasistationary state. 


and from (6.16), 


Ie/2 
tan 6R(£) = EE (6.24) 
The S-matrix element (6.23) is then written 
S(E) = 1 + 2ipT{E) = e? "(1 + 2ipT *E)), (6.25) 


where one has defined the resonant partial-wave amplitude T{"(E) in 
such a way that it agrees with the partial-wave amplitude T,(E) if the back- 
ground can be ignored (y, = 0). From comparison of (6.25) and (6.23) one 
then obtains for the resonant partial-wave amplitude 


1 WP) 


(R), ee ee ee 
Os p(Ey — E)— iT /2 


(6.26) 
Thus, if a quasistationary state with angular momentum / is formed at the 
energy E,, then the phase shift has the resonant behavior specified by (6.1), 
and—if the background can be neglected —the partial-wave amplitude is 
given for the simplest case r = 1 by (6.26). 

The /th partial cross section that one obtains from (6.26), which is therefore 
the pure resonance cross section (any background being neglected), is then 
given [using (XVI.1.38)] by 


(1/2)? 
(Ey — EY + (1/2) 


This function of E is shown in Figure 6.3 together with the corresponding 
resonance phase shift. We see that the resonance cross section has a maximum 
at Ey, and the full width at half maximum of this peak is I’. Ep is called the 
resonance energy, and I is called the resonance width. 

At resonance energy the cross section takes the value 


o®(E) = "2 Cle 1 (6.27) 


4 
off) = Zn (21 + 1), (6.28) 
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Figure 6.3 The Breit-Wigner cross section and its relation to the 
resonance phase shift. 


which is the maximum value for the elastic partial cross section compatible 
with unitarity, as seen from (XVI.2.9). 

The quantity I that has emerged in these discussions can be calculated 
from the phase shifts using (6.12) if these are known, e.g., from the interaction 
potential in potential scattering. However, as mentioned already in several 
places, the situation is more often than not the reverse: One knows the cross 
section from an experiment and notices that it shows, in the neighborhood of 
a certain value E,, the features shown in the lower part of Figure 6.1. Then 
one tries to fit this experimental cross section with a formula 


of? = 4n(21 + 1)| TRE) + T*(E)|’, (6.29) 


where T(E) is the function (6.26) and T°*(E) is a slowly varying background 
term. In this way one determines the values (Eo, I’). 

Equation (6.27) is the celebrated Breit Wigner formula; (6.26) is called the 
Breit- Wigner amplitude. It hardly ever gives a completely accurate descrip- 
tion of the experimental situation, which indeed one would not expect in 
view of the idealizations involved. It almost never fails to give a useful 
parametrization whenever resonance phenomena are involved. It 1s, perhaps, 
the most frequently used formula of quantum physics. Its fundamental 
significance comes from its introduction of the parameter I’: On the level of 
accuracy at which excited states are considered stationary, they are 
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characterized by one parameter,'® their energy value Eo; on the level of 
accuracy at which they are considered quasistationary states, they are 
characterized by two parameters, their energy und width(E,, 1). From Chapter 
Il we know that stationary states are described by eigenvectors of the energy 
operator with eigenvalue Ey; in Chapter XXI we will see that, analogously, 
quasistationary states are described by generalized eigenvectors of the 
[essentially selfadjoint] energy operator with eigenvalue Ey + iT’/2. 

Like every theoretical description, the description of a quasistationary 
state by a Breit- Wigner resonance (6.26) is an approximate description. If 
one wants a more accurate description, one can consider I not as a parameter 
but as a function of energy, Ff = T(E), which may depend upon one or more 
parameters (e.g., the radius of the interaction and the angular momentum !). 
We content ourselves here with the degree of accuracy with which the quasi- 
stationary state is characterized by the two parameters (Eo, I’). 

We now want to establish the connection between resonances and poles 
of S,. We have already introduced, in Section XVIII.5, the name resonance 
pole for a pole of S,(E) in the second sheet of the energy plane immediately 
below the positive real axis. As seen, e.g., from (6.23), the Breit-Wigner 
formula furnishes such a pole in the lower half plane at E = Ey — iI//2. 
The smaller the value of I, and consequently the sharper the resonance is, 
the closer this pole is to the real axis. We have already mentioned that 
there is no pole on the lower half of the first sheet of the energy plane.!° 
Consequently, this pole furnished by the Breit-Wigner formula must be on 
the second sheet of the plane, and thus there must be a pole of S,(p) in the 
fourth quadrant of the p-plane close to the real axis. 

On the other hand, every simple pole of Sp) at the point p = pr = 
kp — iKe in the fourth quadrant near the real axis (Kz < kg) may, but 
does not necessarily, lead to the resonance phenomenon. This is easily 
seen by expanding S,(p) around this pole in a Laurent series. In a sufficiently 
small neighborhood ofthe pole, 5,(p) may be approximated by the principal 
part of the Laurent series ~1/(p — pr). However, Sp) must be unitary 
[|5,(p)| = 1], so the factor multiplying 1/(p — pr) must be chosen to be 
p — px if the rest of the series is to be expressed as an exponential e'?”'. Thus in 
the neighborhood of the pole the expansion of Sp) compatible with unitarity 
is 

kp + ike 
P — Pr pkg thy 
Multiply denominator and numerator by p/m and note that 


=f a 
S(p) = e122) ww ei2ve) P ~ PR _ pianm P 


(6.30) 


f 2 k 2 k2 ee 2 Z 2 2 
FY ey Pe eae _ rR, rn) P _ kr KR 
m om m 2m 2m 2m 2m 2m 2m 

(6.31) 


8 In addition to their other quantum numbers like /, n. 

™ We did not give a complete proof of the statements in Section XVIII.S, but argued that 
causality, the finiteness of the binding energies for possible bound states, and the finite range of 
the interaction are sufficient, though certainly much less is necessary. 
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for real values of p that differ from kp by approximately Kp < ka, i.e., for 
values of p that fulfill |p — kp| ~ Kz. Then with 


2 
P 1 
re rep eee 12 
ay Ee = 5, (kh - KB), (6.32) 
one obtains 
E — Ep — iY /2 2Kerp 
SE) = e!27" R = oe ae 
DS a <> alae rumen) a Cee 
With the same accuracy as the approximate equality (6.31), 
2K rk 
Y(p) ¥ — = =T. (6.34) 


Then (6.33) is identical with the resonance formula (6.23), and consequently 
a pole of S,(p) at pg describes a resonance. 

The above considerations would suggest a one-to-one correspondence 
between poles of the S-matrix immediately below the positive real axis 
and a resonance phenomenon characterized by (6.1); however, they do not 
prove such a correspondence, because we have always assumed that the y, 
has the nice feature of a background phase shift. If y, varies rapidly without 
having the Breit-Wigner form (6.24), then it may compensate the effect of 
the pole, and the total phase shift 6, = 5{*) + y, may be slowly varying. It 
can, indeed, be shown’® that phase shifts }, can be constructed that approach 
the function (6.24) as close as one wishes in an interval Eg -A<E<E,+A, 
A > T, and that therefore describe a resonance phenomenon (6.1), but that 
are not associated with a singularity of e'*”. Thus choosing S, = e'??' gives 
an §-matrix with a resonance phenomenon but without a pole, and choosing 
y, = —), in (6.33) gives an S-matrix with a pole but without a resonance 
phenomenon. 

However, if the fourth and higher derivatives of 6, are small, then, as we 
saw in the first part of this section, y, will be slowly varying. Therefore, one 
generally assumes the one-to-one correspondence between poles immediately 
below the real axis and resonances. 

We also have seen above that a quasistationary state characterized by a 
sharp maximum in the time delay is a resonance. Further, as from (6.19), 
(6.24), and (2.26) it follows immediately that 
db(E) dd dy, _ T’/ _ Wt) 


dE pe Gh (eh, 2) + 17/4 ~ dé 
we see that a resonance leads to a sharp maximum in the time delay for the 
resonating partial wave. Thus a quasistationary state, a resonance, and a 
pole of S, immediately below the real axis are in fuct one and the same phe- 
nomenon. 


(6.35) 


atp(E) = 


20 L. Fonda, Fortschr. Phys. 20, 135 (1972). 
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We shall now discuss the connection between the width I’ and the time 
delay. According to the discussions in Section XVIII.2, (6.35) is the “time 
delay for a monoenergetic beam,” AE <« T. If a resonance occurs, then the 
derivative of the background phase shift is negligible, so that the resonant 
time delay is well approximated by 


do r/2 


dE © (E25) 4 (eo)? 


2tp(E) = 


The “time delay for a monoenergetic beam” that has energy E = Ep identical 
to the resonance energy is therefore 


4 
r 
This idealized time delay is not really a physical quantity, because in the 
case of a monoenergetic beam there is a steady state, as mentioned already 
in Section XVIII.2. 
In the opposite extreme, AE > T, the time delay is essentially determined 


by the experimental energy distribution of the beam, as we see by inserting 
(6.35) into (2.32): 


h= 2 (dE FE - Ede 


th(E)le=£, = (6.36) 


r/2 
PTE) 


(6.37) 


Noting that for AE > T one can take ’/2 — 0 in the integral, and using 


1/2 
(Ey — E)? + 17/4 


>nNO(E, — E) fort > 0, 


we obtain 
tl, = 2nF(Ey — E,). (6.38) 
For a Gaussian energy distribution of the incident beam,”' 
i 1 2 2 
F(E +> E,)=—=—— ¢ @ ee, (6.39) 


Jon AE 


one obtains at resonance energy 


1 
thle,=Eo = /2n KE’ (6.40) 
And for a Lorentzian energy distribution of the incident beam,7! 
| AE/2 
F(E—E,)= -=—s P 
(BE) = ie an: oe 


21 The normalization of F has to be chosen so that F(E — E,) > 6(E — E,) as AE>0 
according to (XIV.5.36). Note that F is the energy distribution of the incident noninteracting 
beam at t = 0, i.e., the beam has been prepared in the remote past in such a way that an energy 
measurement at t = 0 with no interaction present would give the result F(E — E,) for the 
probability distribution to measure E. 
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with AE > T, one obtains 


tole,=Eo = = (6.42) 

In general, if the energy distribution in the incident beam caused by the 
resolution of the apparatus has a width AE of the same order as the resonance 
width I’, one obtains a time delay that depends on I and on the experimental 
energy distribution. A particular situation arises when the experimental 
energy distribution is Lorentzian [Equation (6.41)] with an energy spread E 
that is equal to the resonance width AE = T. Then one obtains by inserting 
(6.41) into (6.37) 


tp = 2/[ (=2h/T in conventional units of time). (6.43) 


Thus we have seen that the time delay has something to do with the inverse 
of the resonance width. For an idealized monochromatic beam this relation is 
given by (6.36). But for more realistic situations the effect of the energy 
resolution of the incident beam will result in other connections between time 
delay and width—e.g., the one given in (6.43). 

So far we have considered for the parameter r, which we introduced in 
(6.10), only the value r = 1. However, according to (6.22) it can take any 
integer value. 

The case r = | seems to be the only case that is realized in quantum- 
scattering experiments, though some time ago there was thought to exist in 
particle physics some evidence for the existence of a “dipole,” which is a 
quasistationary state with r = 2. We shall discuss this case briefly here. 
With r = 2 we obtain from (6.14), ignoring the background (y, = 0), 


E, —E + iI/2\? 
= 6.44 
SKE) & —E- “| Oe 
and therewith for the /th partial cross section 

7 4n(21 + 1) T(E — Eo)’ 
(r= 2) = Z —— — 6.45 
i a (CR 1 

of-?) 
— 5 os Pes 5 


Figure 6.4 Illustration of the energy dependence of the scattering cross 
section (6.45) for a dipole resonance. 
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The shape of the cross section for such a dipole resonance is depicted in 
Figure 6.4. 

This is to be compared with Figure 6.1a. When background scattering is 
present, the cross section will be modified accordingly. 


XVIII.7 The Physical Effect of a Virtual State 


Besides the bound-state poles and the resonance poles, we mentioned in 
Section XVIIL.5 the virtual-state poles on the negative imaginary axis of the 
momentum plane. If a virtual-state pole is sufficiently close to the real axis, 
it can have an appreciable influence on the cross section at low energies. 
Denoting the momentum value of the virtual state pole by py = —ik, 
Kk > 0, we can write the | = 0 S-matrix element in the neighborhood of the 
pole as 
eye ag 
P—Py W-k 
This can be arrived at by the same arguments leading to (6.30), the back- 
ground phase shift having been chosen suitably. The Oth partial cross section 
is then 


(CAs) 


4nk> An 
p'(p? +k?) ? + pe 
In the limit of zero scattering energy, when the virtual pole is close to the 
real axis, this goes into 


Sy = 4n|T ol? = (2) 


69 2S (7.3) 


which is identical to the total cross section, since according to (XVII.5.6), 
higher partial waves can be ignored at low energies. Thus we see that for a 
virtual-state pole close to the real axis the scattering cross section can be 
extremely large. Also large is the scattering length, given according to 
(XVII.6.11) by 


ae ee (7.4) 

It is easy to see that a bound state with sufficiently low binding energy 
will also lead to (7.2) and (7.3) for the cross section. Therefore, a virtual 
state can only be detected in the low-energy scattering cross section if 
low-energy bound states are absent. 

Virtual states have been observed in nature. In the case of proton-neutron 
scattering there exists a bound state, the deuteron (isospin 0, angular mo- 
mentum 1). The scattering length is a!) = 5.4 x 10° '? cm. For angular 
momentum equal to zero (isospin 1), no bound state is present, but 
ai} — —23.7 x 107'* cm. Thus, there is a virtual state in the singlet 
(angular momentum 0) p-n system, whose position is closer to the threshold 
than the triple bound state and which dominates low-energy scattering. 
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Virtual states also cause long time delay. As we are at low energies, the 
phase shift can be obtained from (cf. XVII.6.15) 


1 
p cot 6o(p) = — —. (7.3) 
ao 
From this we obtain 
dbo _ —_ % 1 1 
dp epee KI pk 
Thus the average time delay (2.26), 
; m I i 
twit) =? a 4 
m pwl + p?/K?’ 2) 


can reach very high values. A better illustration of this effect is obtained if 
we compare the time delay with the time delay of a noninteracting classical 
particle in a region of radius R, which is given by (cf. Section XVIII.3) 
ri ™ 2R 
P 

Thus we see that the ratio of the time delay caused by a virtual state to the 
classical time delay is 

pvirt 1 1 0 1 

poe = - es, (7.7) 

a Rk 1 + p*/k R 1 + p*ap 


Depending upon the value of x and R, this ratio can be very high. E.g., for 
the p-n virtual state, taking R x 2.5 x 10° 3 cmanday ~ —24 x 10°'? cm, 
we see that the time delay caused by a virtual state is an order of magnitude 
larger than the classical time. 

A virtual state is thus not simply a fictitious, mathematical state, but a 
physical state in which projectile and target spend a considerable length of 
time together. 


XVIII.8 Argand Diagrams for Elastic Resonances and Phase- 
Shift Analysis”? 


The Argand diagram introduced in Section XVI.3 is a useful tool for the 
detection and display of resonances and the determination of their parameters 
(Eg, 1). To understand this we investigate in detail the behavior of the 
resonant scattering amplitude in the Argand diagram. We first study the 
idealized case of an elastic resonance without background. For this the 
scattering amplitude is given by (6.26), which we write in the form 


1 
pT/(E) = —, (8.1) 


22 G. Bialkowski has helped me with the writing of this section. 
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where we have introduced the abbreviation 
2 
6-4 be) r = cot d{®(E). (8.2) 


Figure 8.1 shows the Argand diagram for Equation (8.1). As we see, this 
elastic Breit-Wigner amplitude lies on the unitary circle. The top of the circle 
corresponds to the resonant energy ¢ = 0, at which value the phase shift 
6,(E,) is 2/2 (cf. Figure XVI.4.1). With increasing energy E, pTF moves in a 
counterclockwise sense—slowly along the lower part of the circle, and more 
rapidly as E gets closer to the resonance value Eg. This is indicated by the 
values for € on the path of pT? in Figure 8.1. 
To take the background into account we use (6.25) and obtain 


pT, = e” sin y, + e2* pT® = pT + e?"pT®, (8.3) 


where we have defined the background amplitude T’® in analogy to 
(XVI.2.8) by 


[, io 
TY) = . ees Ys = 55 (eo — ll), (8.4) 


From (8.3) we see that the effect of the background is to rotate the resonant 
amplitude by the angle 2y,. The quantity pT, for constant background ampli- 
tude is shown in Figure 8.2. The resonance circle starts at the point B and 
traverses the unitary circle, the only difference with the case of Figure 8.1 
being that now the resonance energy is not at the point where the resonance 
amplitude is pure imaginary, but at the point diametrically opposite B. If 
the background changes with energy, then the point of resonance is not well 
defined. 


_ ImPT@)] 


e = 0,(E = Ex) 


Be SNS e= +05 


Re[pTXp)] 


Figure 8.1 Argand diagram for the resonant elastic scattering 
amplitude: pT;(p) = (« — i)~* and cot 6}*) = «. 
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Im[pTAp)} 


Re[pT((p)] 


Figure 8.2 Argand diagram for the resonant elastic scattering amplitude 
in the presence of background: T; =7{*) + T(?®. 


As we have seen above, the resonance amplitude has a characteristic 
circular path in the Argand diagram, the resonance energy being the value 
where the phase shift changes most rapidly. As we shall see in Section XX.3, 
this feature will remain even when inelasticity is taken into account, only 
there the circles become smaller and distorted. 

The Argand diagram can be used to display the energy and width of a 
resonance for a particular partial wave. For this purpose one has just to 
draw pT, as function of energy in the Argand diagram. If pT, runs along a 
circle close to the unitary circle, one knows that this particular partial wave 
has an elastic resonance, and the energy value at which the amplitude has its 
most rapid change will be the resonance energy Eg. If there is no background, 
Ex will be the point at which pT, crosses the imaginary axis. In order to 
draw pT,(E) one has to determine it from the experimental data, and this is 
the main difficulty of phase-shift analysis. In the following we will give a 
brief description of this procedure. 

The data determined in an experiment are the cross sections: the total 
cross section at various momenta o(p) and the differential cross sections 
do/dQ(6, p) at various angles and momenta (or the differential cross sections 
da/d cos 0 = 2n da/dQ). According to (XVI.1.3) these cross sections are 
connected to the partial-wave amplitudes by 

PA 


do co 
Bena = 2k pee! + 1)P,(cos 0)T(p)| . (8.5) 


And by the optical theorem (XVI.2.20), 


a(p) = - ¥ (2 + 1) Im T((p). (8.6) 
1=0 
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These sums extend to infinity and would be of little practical use for the 
determination of the partial-wave amplitude 7,(p). Fortunately, however, 
according to (XVII.6.6) the higher partial-wave amplitudes for a particular 
value of the momentum p can be neglected, so that for a particular value of 
p the sum in (8.5) extends in fact only over a finite number of terms: 

2 


do 
aT Ton cos 0) = 2n x (21 + 1)P,(cos 0)T((p)| , (8.5’) 
and so does the sum in (8.6). How high a value of L one has to take for a 
particular value of p depends upon the problem under investigation and has 
to be determined empirically by the “principle of stability.” One does this 
by choosing some value Ly. Then one determines, by the procedure described 
below, To(p), ..., T,,(p); chooses Lo_ + 1; and determines from the same set 
of data a new Ss of amplitudes T,(p), ..., T,,(p), T,,+1(p). If T, and 6 
determined in this way differ significantly, i was chosen too small for that 
particular value of energy and one has to try a higher value. 

Using the relation for the Legendre polynomials”* 

+0 
Pcos 0)P,(cos 0) = YY <I0I'0|LO0>7P,(cos 6), (8.7) 
L=|l-l| 

where <Il'mm’|L, m + m’> are the SO(3) Clebsch-Gordan coefficients, one 
can write (8.5’) in the form 


do 
dcos 0 


2L 
= 2n Y Pi(cos 0)C fp). (8.8) 
j=o0 


The inversion of (8.8) is 


Die ee ayes do(p, 0) 
6 
4n | gee d cos 0 


For the sake of definiteness let us do this for the particular values L = 1 
and i, = 2, For L = I, (8.5') becomes 


Co P(cos 8). (8.8’) 


—- 50 =2ni|T,\? + 3g tt ee Pe ol ale (8.9,) 


For L = 2 it becomes 


do = 2 Z 2 
Fooe§ 7 2MEITo? + 31TP + 5172) 1Po(cos 8) 
+ 6Re(T*T, + T*T)) P,(cos 8) 
+ [10 Re (T#T) + 6|T, |? + 52| T|2]P,(cos 6) 


+ 18 Re (T*T,)P;(cos 0) + %2| T,|?P,(cos 6)}, (8.9,) 


?3 This is a special case of (VII.3.22b). 
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where the numerical factors are determined from the values of 2! + 1 and 
of the Clebsch-Gordan coefficients. Thus for example for L = 2, 


Cop) = |Tol? + 317, )? + 517 /?, 
C,(p) = 6 Re(T&T, + T#T;), ete. (8.10) 


Now if da/d cos 6 (p, cos 0) has been measured at a particular value of p, 
then one calculates the C,(p) from (8.8'). These “experimental” C,(p) one 
uses in equations like (8.10) to determine the “experimental” values of the 
partial-wave amplitudes. There are 2L + 1 equations (8.10). In addition 
one has the optical theorem 


Tt 


L 
a(p) = — (21 + 1) Im Tp). (8.6’) 
1=0 


leon 


So one has 2L + 2 nonlinear equations for the 2L + 2 real quantities 
Im T,(p), Re T,(p), ! = 0, 1,..., L, or the 2L + 2 quantities 5,(p) and n,(p) of 
(AVEE2N0): 

If the scattering at that particular energy is elastic, then one has only the 
L + 1 unknowns 6,(p), and one need not consider the C,(p) for j > L. There 
are then L + | equations for L + 1 unknowns in (8.10). [The optical theorem 
(8.6’) is then not independent of (8.10).] 

One has to go through the above procedure of determining the partial- 
wave amplitude for every value of p that is plotted in the Argand diagram. 
For small momenta, small values of L (e.g., L = 0) are sufficient, but as the 
momentum increases, higher values of L have to be chosen. As the equations 
are nonlinear, the solutions are not unique. Further, the experimental data 
have errors. The uncertainties that result can be overcome partially by 
considering p7,(p) at several values of p and requiring that it changes smoothly 
with momentum. Sometimes further relations, such as dispersion relations, 
are used to obtain this smoothness. In this way one obtains a picture of pT, as 
a function of p. 


XVIII.9 Comparison with the Observed Cross Section: the 
Effect of Background and Finite Energy Resolution 


Due to the effect of the background in the resonating partial cross section 
and the background of other nonresonating partial cross sections, the 
experimental cross section near resonance does not usually look as simple as 
in Figure 6.1a. We first consider in detail the interference between the reso- 
nance and the background. 

If we insert (8.3) into (XVI.1.38), we obtain for the resonant partial cross 
section 


ee 2 
o, = 4n(21 + 1) ie sin y, + e?"'TR} . (9.1) 
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We introduce the Fano shape parameter”* 
pi 
q=+coty,,  €=(ER-&£) e cot 6{*)(E). (9.2) 


q is called the profile index or shape profile parameter, and e¢ is defined as in 
(8.2). After some straightforward calculation one obtains for (9.1) 


ey q? + 2eq —1 

G;— 4x2) + | sin? y, + eeu (ome ae . (9.3) 

This is rewritten as 

2 
q* + 2eq — 1 
6, = of?) ae o(®) (oo), (9.4) 
where 
4n 1 

a) = 9.5 
o} ape ea (9.5) 


is the Breit-Wigner cross section of (6.27), and o}”) has been defined in 
analogy to (XVI.2.9) by 


4 
of) = a (21 + 1) sin? ,. (9.6) 


Inserting (9.5) and (9.6) with (9.2) back into (9.4), one can write the resonant 
partial cross section also as 


4n (q + ©)? 4n 5. qte 
= — 1 — 2l ae 1 } D) ’ 97 
wl p? es eee) fee IOS q+ ee) 
OT 
(q + ©)” 
— (be) 
: ee ee - 


The factor (q + ©)?/(e? + 1), which gives the shape of the resonance, is 
shown for various values of the shape profile parameter q in Figure 9.1. 
We see that the shape of the resonance in the resonant partial cross section 
depends considerably upon the value of the parameter q, which is connected 
with the background phase shift y,. (If the background phase shift depends 
upon the energy ¢, which is generally the case, then the shape becomes 
even more complicated.) For positive values of q we see that for increasing 
energies E, i.e., for increasing values of —e, the cross section increases above 
background, then falls sharply at the value of the resonance energy « = 0, 
is zero at € = —q, and increases thereafter. 

For negative values of q the cross section decreases with increasing 
energy, reaches zero at ¢ = q (i.e., before resonance), and then increases 
sharply. 


24 The original parameters introduced by U. Fano [Phys. Rev. 124, 1866 (1961)] were the 
negative of the parameters defined by (9.2). 
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Figure 9.1 The resonance profile function for different values of the profile index 
q. The arrow above E indicates the direction in which E increases. [From Massey 
& Burhop (1969), vol. 1, with permission. ] 


We are now in a position to explain the behavior of the experimental data 
depicted in some of the figures of Section XVHI.1. As we have so far only 
discussed the effect of the background, with the present results we can only 
interpret those experiments in which the energy distribution in the beam 
and the resolution of the detector can be ignored. These are the high-resolu- 
tion experiments whose results are depicted in Figure 1.3 and Figure 1.4d. 
These experiments measure the transmitted current, which is proportional 
to the incident current minus the total cross section. As the total cross section 
is the sum of the resonant /th partial cross section (9.8) plus the slowly 
varying background cross section from the other partial waves, a dip in the 
resonant partial cross section should show up as a peak in the transmitted 
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current, and a peak in the resonant partial cross section as a dip in the 
transmitted current. According to the results of Figure 9.1, the resonance in 
the Ith partial cross section should show up in the following way: 


For q > 0, as a dip around resonance followed by a peak right after 
resonance. 

For q < 0, as a peak right before resonance followed by a dip right after 
resonance. 


In the high-resolution experiment (very narrow spread of the electron 
beam) of Figure 1.4d, and in particular in the very high-resolution experi- 

ment of Figure 1.3, one can see that the situation described for q < 0 is 
observed. (If one knew the exact value for Eg, one could even infer the value 
of q). Thus our hypothesis of Section XVIIL.1, that in the collision process 
(1.4) a quasistationary state He~ is formed at around 19.31 eV, is justified. 
We shall see that the other experiments of Figure 1.4 and Figure 1.5 lead to 
the same conclusion. For these experiments the resolution of the apparatus 
is coarse, and the quantity measured is not the cross section at a particular 
energy value, but the total cross section in a whole energy interval. 

The total cross section (XVI.1.33) is written as 

6 = 3G, + > 6 = a; + G3, (9.9) 
Ul 

where o, is the resonant partial cross section given by (9.8) (describing the 
resonance and the background) and it is assumed that there is no resonance 
of any angular momentum /’ other than |. The background cross section 0, 
is then given by 


4n : 
6,=— 9 QI’ + 1)sin’ y,, (9.10) 
Presi 
where y,(E) are slowly varying background phase shifts. 
Inserting (9.10) ande(9.8) into (9.9), we obtain for the cross section 
(Cae 
=o," OR +o,, (9.11) 
where of°®) and o, vary slowly with energy. 
At energies far away from the resonance energy, ¢ > 20, we obtain from 
(9.11) 
o> 0) + o, = Onn (9.12) 


where we called o;,,,, the slowly varying non-resonant background. The 
maximum of the cross section (9.11) occurs ate = + 1/q,and at this maximum 
the cross section is 


Omax = ali Bi ;, RF q’o\”®) (9.13) 


Because of (9.12) the cross section (9.11) 1s conveniently written in the 
form 


2 3) = pA ae 
6 =o, + 6, + q + 2ge—-1 of?) — q+ 2ge — 1 of), (9.14) 


les : 
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(q? + 2ge— 1/1 +0?) 


Figure 9.2 The function that determines the form of the resonance 
effect relative to the smooth background. [Massey Burhop [1969], vol. 1, 
with permission | 


which shows that the resonance occurs above a gradually varying 
background o,,,, and has the characteristic shape determined by the factor 
(q? + 2qe — 1)/(1 + €?). Thus, if there is background from other partial 
waves, then in distinction to (9.8), the cross section no longer vanishes at 
€ = q but has its minimum value there, ¢,,,, = 0», which lies below the 
value for the background o,,,). The shape is now determined by the shape 
factor shown in Figure 9.2. 
Meg ae 


1 3 


then the resonance effect results in a reduction from the background on 
one side of the resonance that is greater than the increase above the back- 
ground on the other side. For |q| > 1 the reverse is true. 

So far in our discussions we have always assumed that the energy resolu- 
tion of the experimental apparatus is ideal. This means that the initially 
prepared beam can be considered as ideally monoenergetic, so that the 
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energy distribution function can be described by the 6-function according 
to (XIV.5.36): 


F(E — E4) = p(E)<E\||WaillE> = o(E — E,4), (9.15) 


where E, is the central energy value in the beam. Such an ideal resolution 
can be achieved in two ways: either by an ideal preparation apparatus (a 
high-resolution monochromator), or by an ideal detection apparatus (a 
high-resolution analyzer in the experiment of Figure I1.4.1). Both will lead 
to the same result because of enetgy conservation. As already mentioned in 
Chapter XIV, what constitutes ideal resolution depends upon the effects 
that are to be observed: The energy spread of the beam AE must be small 
compared to the width of the structure in the scattering amplitude in order 
that (9.15) may be used. 

The observed quantity is not the cross section o(E) at a definite value E, 
but the energy-averaged cross section 


Pe | dE F(E — E,o(E), (9.16) 


where F is determined by the energy resolution of the detection (or prepara- 
tion) apparatus, because every realistic detection apparatus detects particles 
not only with a definite energy E,, but with an energy in a finite interval 
AE around E,. If o(E) varies slowly as compared to F(E — E,), then 
ao *F =~ o(E,), which is equivalent to using (9.15). But if AE is not small 
compared with the width T’ of a resonance, and consequently o(E) given by 
(6.27) does not vary slowly, then one cannot approximate o * F by o(E,4). 
The observed quantity is then the quantity given in terms of the T-matrix 
elements by (XTV.5.40). o(E) is just an abbreviation for the expression under 
the integral besides F(E — E,) on the right-hand side of (XIV.5.40) and is not 
an observable quantity. Thus for a resonance of the /th partial wave with back- 
ground in the Ith partial wave and the other partial waves, o(E) = o(e) 
given by (9.14) is not the observable physical quantity if the width Tis of 
the same order as the energy resolution AE. The observable physical quantity 
iS 


COE) =¢rk = [uz o(E)F(E — E,) 


l| 


| dE (o'®(E) + o,(E))F(E — E4) 


+4 2ge — 1 
f | TOE OM EVE -E,) dE. (9.17) 


The precise value of this depends upon the energy distribution [the detector 
efficiency function F(E — E,)] for the particular experiment. For the sake 
of definiteness let us assume that the probability of measuring the energy E 


The Effect of Background and Finite Energy Resolution 445 


— 
ah 
AE 


F(E-E,) 


E4 


Figure 9.3 Assumed energy distribution for the beam with energy 
resolution AE > IT. 


is zero except for values in an interval of AE around E,. Then 


I AE AE 
— for E,-—<E<E,+-—, 9.18 
FE — E,y= 2 AE Aa a 2 So 


0 otherwise, 
depicted in Figure 9.3, will describe the apparatus resolution. Equation 
(9.18) is of course also an unphysical distribution like 6(E — E,), and a 
Gaussian around E , would often best reflect reality. However, for a beam with 


energy spread AE > I, the assumption (9.18) is of sufficient accuracy. With 
(9.18) we obtain for the observed cross section (9.17): 


EatAE E,t+AE 
aye [7 ae = a (o'®(E) + 0,(E)) dE 


AE E,—AE AE E4-AE 
eat pee te ee (be) : 9.19 
Piel ae eos * of(EME ged 


o*®)(E) and o,(E) are slowly varying functions of E. Therefore 
1 E4+AE 
Cou(E)> = zp [| CoE) + of OME 
AE E4—AE 


~ oP®(E,) + o,(E4) © o7(Ep) + 9,(Ep) 


is a slowly varying function of the average beam energy E, whose value is 
well approximated by the value it takes when the beam energy E, is equal 
the resonance energy Eg. 

The second integral on the right-hand side of (9.19) is given by 


Le. a 2 AE/T a 5) Pe ie 1 
(o(E,)> = —5 5 ome) | ie 


eat 2AE/V 1 + €* 


€4 = (Er - Ee 
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For E, far away from Eg, ie., for €,4 far away from zero, this will be close to 
zero, as can be seen from Figure 9.2, and the averaged cross section will be 
given there essentially by <o,,(E4)>. At the value of the average beam energy 
E, = Ep, the value of <o,(E,)> is given by 


PANN re a a! 
(ofEn)) = — 5 oP Ex) {hae 


2 AE/T 1" 
sO a: 9) = l| 
S of | ie eae 
et l+e 


Sp) Se 


ofP®n(q? — 1). 


In performing the integration we have taken the limits of the integral over 
the shape factor as +o, which for AE >T does not cause an appreciable 
error, as can be seen from Figure 9.2. Inserting this and 


o(°®(Ep) = oe (21 + 1) sin? y,(Ep) (9.20) 


into (9.19), we obtain for the value of the averaged cross section when the 
beam energy E, is set at the resonance energy Er 


<o(Er)> = <On(Er)> + (,(ER)> 


Se T(2l + 1) cos 2y(Ep). (9.21) 


2n? 
(WER)! 
Thus if a value of <o(Ep)> is observed in a cross-section measurement with 
an apparatus with poor energy resolution, then it may be above or below 
or even equal to the slowly varying background <o,,(E4)>, depending upon 
the value of the background phase shift in the resonant partial wave, 
y(ER) © y(E,). If cos 2y, > 0, then the resonance will show in the observed 
cross section <o(E,)> as a peak above background <o,,(E,4)> at the value 
E, = Ep. If cos 2y, < 0, then the resonance will show as a depression below 
background. For cos 2), ~ 0 the resonance will not show up at all in the 
observed cross section. 

With these results we can interpret the experimental data in Figures 1.4b, 
1.4c, and 1.5, which were taken in an experiment with energy spread of the 
electron beam, AE, larger than the resonance width. Figures 1.4b and 1.4c 
show an increase of the transmitted current and a decrease of the total elastic 
cross section around the value E, > Eg = 19.3 eV. This is what one expects 
from (9.21) if the background phase shift y, satisfies cos 2y(Ep) < 0, or 


4m < y(Ep) < am. (9.22a) 


The same effect is observed in the averaged differential cross section at all 
angles depicted in Figure 1.5. 
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From the data of the high-resolution experiments in Figure 1.3 and Figure 
1.4d, we concluded above that q < 0. Using (9.2), this means that 


30 < y(Ep) < 2. (9.22b) 


Thus combining (9.22a) and (9.22b), we conclude as a result of our qualitative 
considerations that 


30 < y(ER) < jn. (9.23) 


This shows that the background phase shift in the resonant partial wave for 
the process e + He > He > He + @ is not at all small. Taking just TP 
of (6.26) as the /th partial scattering amplitude and of*) of (6.27) as the cross 
section around resonance energy would certainly be unjustified, 

We have now explained the experimental data for the elastic scattering 
process e + He + e + He as a resonance of narrow width around 19.3 eV 
interfering with the background phase shift fulfilling (9.23). In order to 
obtain more information about this He~ resonance, we turn to the experiment 
of Figure 1.5. This is a low-resolution experiment with AE = (50-100) x 107? 
eV in which the intensity of the elastically scattered electrons is measured as a 
function of average beam energy at various values of the scattering angle, 
€@ = 10°, 40°, 60°, 90°, 110°. Thus Figure 1.5 gives us the averaged elastic 
differential cross section as function of scattering angle and energy: 


do do 
(2 (E,, 0) = fae 79 (E QFE — Ea), 


where F is the energy-resolution function of the apparatus in this experiment. 

With the experimental data of Figure 1.5, one can perform the phase-shift 
analysis as described in the second part of Section XVIII.8. Assuming that 
in the energy range considered the partial-wave amplitudes T, with | > 2 
can be ignored, one can use (8.9) for the determination of the phase shifts 
69, 0,;, and 0, at various values of the momentum. As the inelasticity co- 
efficients n, are zero in the present case, where only the elastic channel is open, 
the T(p) of (8.9) are connected with 6,(p) by (cf. XVI.2.8) 


pi <p) =e" sin d;. (9.24) 
The Legendre polynomials that occur in (8.9) are 
ere Ie P,=cos@, P,=4(3 cos’ 6 —1), 
P#=)(5:cos* 0 310810), P, = $35 cos* 0 — 30cos* 0 + 3). (9.25) 
One can first ignore the partial-wave amplitudes above T, and use (8.9,). 
At @ = 90° one has, according to (8.9,), only the contributions from 7). 


As according to Figure 1.5 there is still a resonance effect at 0 = 90°, one 
concludes that the resonance occurs in the S-wave: 


= 0, (9.26) 
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One can now apply (9.21) to the cross section do/dQ(E,, 0 = 90°). Instead 
of the apparatus resolution function (8.18), one may have to take a different 
one, depending upon the experiment.”° Then the expression on the right-hand 
side of (9.21) will be different, but the principle of determining the resonant 
phase shift 5) = 2/2 + yo is the same. The curve in Figure 1.5 for @ = 90° 


represents 
do ¥ 
Ze, ,9 = 90 )). 


It shows again the decline below the nonresonant differential cross section 
characteristic of a resonating partial wave with background phase shift 
given by (9.22a). From the values of 


do ‘ 
(3 (E4,9 = 90 ») 


at various energies E, around the resonance energy Eg, one can determine 
the background phase shift y) and the true resonance width I. The values 
obtained from this experiment are 


T ~ (15-20) x 1073 eV, 79 & 100°. (9.27) 


Yo is in agreement with the restrictions (9.23) obtained from the other experi- 
ments above. 

One may now use the data of Figure 1.5 at 6 = 60°. P,(cos 60°) is close 
to zero, and the influence of a possible T, term, which would be considerable 
for P,(cos 0) 4 0 according to (8.9,), is minimized. Fitting (8.9,) to the 
experimental data for @ = 60° with 6§ = 1/2 and y) = 100°, one obtains 


Si pespn eee (9.28) 


To check that the contribution from the higher partial waves is small, one 
can now fit the experimental data for the other scattering angles 6 with 
(8.9,). For example, with yo = 100° and y, = 25°, one obtains from the 
experimental curve for 8 = 10° 


0> = V2 ~d 4°, (9.29) 


This is indeed small, and justifies the working hypothesis made at the begin- 
ning of our analysis that T, and all higher partial waves can be ignored. 
The He in the collision process 


e+ He>He ~—He+e (1.4) 


25 Most frequently the apparatus resolution function is a Gaussian (6.39). The observed 
cross section is then according to (9.17) the convolution of a Gaussian and a Lorentzian func- 
tion, which is known as the Voigt integral or Voigt profile. The Voigt integral cannot be found 
analytically, and has been computed and tabulated. The Lorentzian width T and the Gaussian 
width AE can be obtained from a fit of the experimental data with the Voigt profile. See, e.g., 
H. G. Kuhn, Atomic Spectra, Academic Press, 1969, Section VIID. 
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is in the ground state 'S, with | = 0, s = 0. The e in this process has s = 4, 
and as we know from (9.26), | = 0. Consequently, the quasistationary state 
He™ has! = Oand s = },ie., it isa *S,)2 State. It is very reminiscent of the 
*S1/2 ground state in the lithium spectrum (cf. Figure VII.2.1). The only 
difference is that the charge of the nucleus in lithium is +3, whereas that in 
He’ is +2, and the He™ state has a width different from zero and given by 
(927), 


Problems 


1. (a) Obtain the S-wave phase shift and the | = 0 S-matrix element S,(p) for 
the square-well potential. 

(b) Find the energy values of the bound states of angular momentum 0 as 
poles of So(p). Discuss the relations for the mass, the depth Vj, and the 
radius R of the well that have to be fulfilled in order for there to be (i) no 
bound state, (ii) one bound state, (iii) two bound states. 

(c) What are the virtual states of angular momentum zero? Under which 
conditions do there exist no virtual states for ] = 0? 

(d) Consider a very deep well fulfilling 2mV,R? > 1. Show that there are 
regularly spaced s-wave resonances. Determine energy and width of these 
resonances. 

(e) Obtain the scattering matrix element So(p) at energies close to a resonance 
energy, and obtain the background phase shifts. Calculate the elastic 
scattering cross section (/ = 0), and show that it can be written as the sum 
of three terms: one from background scattering, one from resonance 
scattering, and one from the interference between resonance and back- 
ground. Give a graphical representation of the elastic scattering cross 
section as a function of energy. 

2. Calculate the time delay caused by a narrow resonance of angular momentum 
I, and with resonance parameters (Ez, I) of a beam with an energy distribution given by 


’ 1 
> <1, 1E|W*|Ell;> = W(E) = —— F(E — Eg) for every l, 
Is 


P(E) 
where 
1 AE AE 
aS ibe =P Sb, + —, 
F(E — Ep) = (AE 2 Z 


0 otherwise, 


with AE > IT. Give the result in the approximation of constant background phase 
shifts. 

3. Calculate the time delay of a beam with a Lorentzian energy distribution (6.41) 
caused by a resonance with resonance parameters (Ey = E,, 1 = AE). 

4. Taking for the scattering amplitude an expression of the form 


Pa r2 
T, = — 
p(E — Eo) — i(T/2) 
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will also lead to a Breit- Wigner cross section (6.27). Why can T, not describe the partial- 
wave amplitude for resonance scattering? 


5. (a) Describe the behavior of the resonance phase shift for a “dipole” reso- 
nance (6.44). 
(b) Discuss the cross section near resonance energy as a function of energy 
for a dipole resonance, taking the effect of a slowly varying background 
y(E) into account. Draw graphs in analogy to Figure 6.1. 


6. Compare the case of a dipole resonance with the case of two resonances close 
to each other that are described by a scattering amplitude 


loki dy, Ay ei2theE)— EN 
T(p) = — 27 ®) ( . + 
P Exg—E—i(Tp/2) Ex—E-i(ly/2) 
where jz. g,)’x, Vy are real constants; E; > [; > y; (i = R, X)and|Eg — E,y|are of the 
same orders as 'z and 'y; and 8(E), »#(E) are slowly varying functions of E. Consider 
first the special case of zero background shifts yf = y* = 0. 


7. (a) Fora resonance, p7,(E) moves counterclockwise in the Argand diagram. 
Show that this is related to causality. 
(b) Discuss when p7,(E) can move clockwise in the Argand diagram. 


8. Consider the quasistationary state with energy E, and width I for which the 
parameter r in 


T/2 
SUE —Texo (i arctan E, : 5) 


has the value r = 3. Obtain the slope of the resonant cross section for such a “tripole™ 
resonance. (Ignore the background.) 


9. Explain, using the shape of the function o,(E) in Figure 6.1 for a background 
phase shift y, = 7/4, why the cross section <o(E4)> observed by an apparatus with 
poor energy resolution AE > I need not show a structure at the energy value E, equal 
to the resonance energy Ep. 


10. Show that the effect of a bound state with low binding energies in elastic 
scattering is the same as that of a virtual-state pole close to the real axis. 


CHAPTER XIX 


Time Reversal 


For the discussion of multichannel resonances in Chapter XX, we require 
the properties of the S-matrix that follow from time-reversal invariance. 
In Section XIX.1 space-inversion invariance is applied to determine some 
properties of the S-matrix. Time reversal is introduced in Section XIX.2. 
The main purpose of Section XIX.1 is to introduce Section XIX.3, where 
the consequences of time-reversal invariance for the properties of the S- 
matrix are examined. 


XIX.1 Space-Inversion Invariance and the Properties of the 
S-Matrix 


In Section V.4 we introduced the observable parity, represented in the space 
of physical states by the unitary operator U>, fulfilling the relations (V.4.1) - 
(V.4.3) with the position, momentum, and angular-momentum observables. 
We mentioned in Section V.4 that not all energy operators commute with 
the parity operator — in particular, the interaction Hamiltonian that causes 
the weak transitions, and consequently the transition operator for the weak 
interaction, do not commute with U,. The weak interaction violates parity 
invariance. However, many scattering and decaying systems are parity- 
invariant, i.e., in addition to 


bho U,| 0 Gal) 


they also fulfill 
ic; 0; or [T, Up] = 0, (1.2) 
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or in terms of the S-operator 
S = USSU, (1.3) 


Parity invariance leads to some constraints on the T-matrix and S-matrix 
elements. For example, it follows from (V.4.2) and (V.4.5) that 


Up|pn> = 2,|—Pn"), (1.4) 


where, 15 + lor —f, and for the angular-momentum eigenvectors (XVI.1.3), 
in analogy to (V.4.39), 


Up |Ellsn> = (—1)'n,| Ells"). (1.5) 


The additional, internal quantum numbers y usually do not change under 
Up; thus 7” = n. The internal parity , may be an element of the set of ad- 
ditional quantum numbers 4. 

From (1.4) it follows for the S- and T-matrix elements in the momentum 
basis, using (1.3) or (1.2), that 


<p’'n'|S|pn> = <p'n’| US SU, |pn> 


= (—p'n'|S| —pn>2, 7, (1.6) 
or 


<p'n'|T |pn> = <—p'n'|T|—pnt, 7,- (1.7) 


From (1.5) it follows for the T-matrix element in the angular-momentum 
basis (assuming T is spherically symmetric) that 


CElIyn'|T|Ellyn> = (EIT 3n'|T| Ellyn >(— 1 ty My (1.8) 


The conditions (1.7) and (1.8) constitute restrictions on the properties of the 
T-matrix elements. 

We shall also restri¢t ourselves to the case discussed in Chapter XVI, 
namely, where the internal quantum numbers y do not contain internal 
angular momenta. Then, using (XVI.1.18), one obtains from (1.8) 


CEIT34'|T | Ellsq> = ty CEU n'|T| Els now Oras (1.10) 


or for the partial-wave amplitude defined by (XVI.1.34), (XVI1.23), 
(XVI.1.18), 


TCE) = ae (1.11) 


This shows that in the case of elastic scattering of spin-zero particles, parity 
invariance does not impose any additional restrictions that are not already 
consequences of rotational invariance. For yn # n', (1.11) states that the 
partial-wave reaction amplitude will be different from zero only if the in- 
ternal parity of the initial state is equal to the internal parity of the final 
state x, = 7, or if the internal parity is conserved. Parity invariance (1.2) 
leads to parity conversation (1.7), (1.8), (1.11), in the same way as rotational 
invariance (XVI.1.1) led to angular-momentum conservation (XVI.1.18). 
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XIX.2 Time Reversal 


The main purpose of the preceding discussion on parity invariance is to 
Serve as an introduction to another kind of invariance, which superficially 
appears to be very similar to parity invariance but is of a completely different 
kind: the invariance under time reversal. Naively, time reversal is the re- 
placement of t by t® = —t; however, this is not a physical operation like 
taking the mirror image x; > x = —x,, as we cannot turn time backwards. 
But we can compare the state of a physical system having a position x; and a 
velocity v; with the state of the physical system having the same position x; 
and the velocity —v;. If in a scattering process all positions and velocities 
undergo this transition at all times: 


FT ix74+x,7 :v>-v, or7 :x>x,7:p- —p, (2.1) 


then one obtains a process which looks like the original process going 
backward in time. 

The time-reversal operation 7 is the velocity reversal (2.1). For a quantum 
physical system this transformation 7 is represented by an operator in the 
space of physical states. We call this operator the time-reversal operator 
A,. If Ay represents a usual observable, like those we have considered so 
far, then A; should be a linear operator. We shall see instantly that A; 
cannot be a linear operator. 

In analogy to (V.4.1) and (V.4.2) for the parity operator Up, one would 
translate (2.1) for the quantum physical observables Q; and P; into 


A7Q;Ar* = Qi, (2.2) 
AS = —P;. (2.3) 


We shall justify this below. 
For the orbital-angular-momentum operator L; = ¢€;;,Q;P, and thence 
for any angular-momentum operator J,, it follows from (2.2) and (2.3) that 


A = —J;. (2.4) 


Let us now consider the commutation relation between the momentum 
operator P; and the angular-momentum operator J;: 


[Ji, Pa) = tena Pr, (2,9) 
and insert (2.3) and (2.4) into (2.5): 
ArLJi, P,JAr’ = — ie Ar P,Az’. (2.6) 


If A; were an ordinary linear operator, then we would obtain by multiplying 
(2.6) with A;! from the left and A; from the right 
[Ji. Pr] = —t€ns Pi. 


This would contradict (2.5). Thus if A; is to reverse the direction of the 
momenta (2.6), keeping the positions unchanged, then A; cannot be a linear 


operator. 
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[A one-to-one mapping A of a linear space # over the complex 
numbers C is called a semilinear operator if 


Aw + )=Aw+ Ad forall y, de #, Ag — A — 0, 
(27a) 
and 
Aad = a’ Aga e Ca’ = a, or a’ = a*. (2.7b) 


A semilinear operator is called antilinear if «’ = «* (complex con- 
jugate of «), ie., if : 


Aad = a* Ad. (2.7c) 


A semilinear operator is called linear if «’ = «. Thus in distinction 
to a linear operator, already defined by (1.3.1), an antilinear operator 
also transforms the complex numbers into their complex con- 
jugates: 

Ai = —iA. QO) 


In a scalar-product space # one can uniquely define for every 
antilinear operator A (defined everywhere in # or in a dense 
subspace thereof) an antiadjoint operator A‘ (often simply called 
the adjoint operator) by 


(4f,9)=(4'¢, =(f,4'9)*, f.ge#. (2.8) 


The following relations hold for a linear operator L and antilinear 
operators A,, A,: 


(A,A,)' = AJA}, (AL)P=LIAt, (A, LA,)' = ASLTAY. (2.9) 
An antilinear operator is called antiunitary if it preserves the norm, 
Le. 1f 

(Ag, Af) = (A‘Af, g) = (f,.9), ot A= At 10) 


The product of two antiunitary operators is a unitary operator. 
For antilinear operators one has, in analogy to the case for 
linear operators, 


Tr(AW) = Tr(WA),  Tr(QAWA') = Tr(AtOAW), (2.11) 


where W and Q are Hermitian operators. It is always assumed that 
the operators are such that all traces that occur in the proof are 
well defined. | 


If we choose for the representative of the velocity reversal or time reversal, 
A,, not a unitary operator as we did for the space inversion in Section V.4, 
but an antiunitary operator, then (2.6) will not lead to a contradiction, but — 
using (2.7c)—back to (2.5). 

We therefore make the following statement: The time-reversed or velocity- 
reversed state W"(t) of the state W(t) is obtained from W(t) by 


W(t) = A,W(t)Ab, (2.12) 
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where the time-reversal operator A; is an antiunitary operator 
Al Ayal (2.13) 


that fulfills the defining relations (2.2)-(2.4).' 
In addition A, fulfills 


A2=el, wheree = +1 or —1. (2.14) 


Furthermore we require that the energy and transition operators commute 
with the time-reversal operator A, :? 


A;K — KA, = 0, A;,H a HA; = 0, Ar;T— TA; = 0, (2a5) 
and that the S-operator fulfill 
ASA: — S|, (2:15) 


(2.15) and (2.15’) are the statement of time reversal invariance. 

The justification of this statement consists of two parts. The first and more 
difficult part is to show that the operation of time reversal is represented 
by a semiunitary operator [1.e., an operator fulfilling (2.7a), (2.7b), (2.13)]. 
In our heuristic arguments above, we had simply assumed that—in resem- 
blance to the basic assumption [—time reversal is represented by something 
like a linear operator. The justification of this is, essentially, the celebrated 
Wigner theorem and applies to a much wider class of transformations than 
that of time reversal. This part is briefly discussed in the appendix to this 
section. 

The second part in the justification of the above statement is to establish 
the particular properties of the time reversal operator expressed by the 
relations (2.2), (2.3), (2.4), (2.15). 

From the physical interpretation of the time-reversal or velocity reversal 
operation, the following requirement results: If the expectation value of the 
momentum in the state W(t) is f, then the expectation value of the momen- 
tum in the state W7(t) should be — fp. This means 


Tr(P, W(t)) = —Tr(P;, W(t) = —Tr(P;A;W()A}) = —Tr( A} P; Ar W(0). 
(2.16) 


As this is to be fulfilled for any state W(t) and its corresponding time-reversal 
state W7(t), it follows that 


P; = — A}P; Ay, 


from which (2.3) is obtained. Equation (2.2) follows in the same way. Equa- 
tion (2.4) follows from (2.3) and (2.2), as mentioned above. It has also already 
been mentioned that from (2.6) the antilinearity of A; follows. 


1 If there exist additional independent observables that cannot be expressed in terms of 
P,, Q;, and J;, then their relation to Ay also has to be specified. 

2 So far, this requirement has been found to hold except for decaying K° mesons, and there 
may be a superweak part of the Hamiltonian that does not fulfill (2.15). 
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Executing the operation of time reversal two times on all states, 
WIT(t) = ApW(t)A} = Ap Ay WAS AS, 217) 
should lead to the original state. From this requirement it follows that 
A;A, = el withe. —. (2.18) 


But as A, A2 = ApApAz = AZAz and consequently A;e = €Ar, it follows 
from the antilinearity that € must be real. From this (2.14) follows. 

Whether one has to choose for the time-reversal operator an A, fulfilling 
(2.14) with e = +1 or with e = —1 has—at least at the present time—to be 
decided by experience. The following choice has proved to be without con- 
tradiction to experimental observations: 


(=) (2.19) 


where s is the spin of the physical system.** 

The requirement (2.15) is empirical in nature: No interaction has been 
found (except for the abovementioned K° system, which we shall ignore 
here) that violates time reversal invariance as stated by (2.15). Also, if the 
Heisenberg equation of motion (XII.1.24) with (XII.1.26) is to be fulfilled 
by the position operator, then it follows from (2.2) and (2.3) and the anti- 
linearity of A; that H must commute with A7. 

Equation (2.15’) is a consequence of (2.15) and the antiunitarity of Ay. 

Therewith we have justified the above statement. 

The state W and its A,-transformed state W’ may be considered as dif- 
ferent states of the same physical system. It is, however, more practical to 
restrict the scope of the physical system and to consider all the A7-trans- 
formed states of all the states of a physical system as belonging to a new 
system, the A;-transformed system of the original system. To obtain an 
idea about the property of the A;-transformed system, let us consider a 
particular state: 


W(t) = A, W(t)Ab. 


From the time-development axiom V—in particular, from (XII.1.15) and 
(XII.1.16)—it follows that 


wit) _ Asc Woe Ay = et AW Aree 
= eM ony Teton, (2.20) 


Thus the A,;-transformed system develops forward in time in the same way 
as the original system would develop backward: W(—1) = e ?# Woel OF, 
If an original scattering system develops from a particular in-state W'™ into 
a particular out-state W°"'", then the time-reversed system develops from an 
in-state (W")" agreeing with the out-state of the original system (W7)'" = 
W" into the out-state (W7)™ = W". 


21 BP. Wigner, in F. Giirsey (ed): Group Theoretical Concepts and Methods in Elementary 
Particle Physics. Gordon and Breach, New York, 1964, p. 37. 
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Appendix to Section XIX.2 


In this Appendix we give a brief description of the conditions and arguments 
that lead to the statement that the transformation of a physical system is 
described by a semiunitary operator. We choose here the transformation 
of time reversal. 


Let W(t) denote the state that is obtained from the state W(t) by the 
operation of time reversal 7: 


FT: W(t) > W(t). 
It is natural to require that: 


1. If two states have no property in common, then the 7-transform of 
these two states should also have no property in common. 

2. If one state has a more specific property than a second state, then its 
JF -transform should also have a more specific property than the 
7 -transform of the second state. 


Let us restrict ourselves to states that are described by projection op- 
erators A, and A,. Then requirement 1 means: 


A, A, = 0 > ee = 0, 
Or 
A, HLA,H >A HLTH 


(orthogonal properties go into orthogonal properties). Requirement 2 
means: 


A,A, = A, => ALAS = ee 


A, is the state with the more specific property than A,. (As an example, A, 

may be the state that has angular momentum j = 2, and A, the state that 

has angular momentum = 2 and angular-momentum component/, = +2.) 
It is further natural to require that: 


3. Time reversal is a one-to-one mapping in the set of states. 


From these three requirements it follows by a general mathematical 
theorem? that time reversal.7 is described by a semilinear operator A,: 


TFT: Wt) > W'(t) = A, W(t)At. 
It is further natural to require that: 


4. Overall probability should be preserved by the operation of time 
reversal. This means that for all W and their corresponding W* 


Tr W =Tr W'. 
From this follows (2.13). 


3 E. Artin, Geometric Algebra, Academic Press, New York, 1951, p. 88. 
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Thus, the operation of time reversal is represented by a semiunitary 
(unitary or antiunitary) operator. 

So far no specific property of the time-reversal operation has been used, 
and the above arguments apply to any transformation of the physical states 
with the properties 1, 2, 3, and 4. Thus every such transformation is repre- 
sented by a semiunitary operator which is in general determined up to 
a phase factor. The celebrated Wigner theorem,* which states that a sym- 
metry operation of a quantum physical system is induced by a unitary or 
antiunitary operator, is contained in the above statement. 


XIX.3 Time-Reversal Invariance and the Properties of the 
S-Matrix 


Before we can apply the assumption of time-reversal invariance to find the 
properties of the S-matrix and T-matrix elements, we have to determine 
the action of A; on the basis vectors. This is done in the same way as it was 
for the parity operator Up in Section V.4. For the momentum eigenvector 
|p7> it follows from (2.14) that 


P(Ar|pq>) = —PArlpn>). (3.1) 


Consequently, (A;|py>) is also an eigenvector of P; but with eigenvalue 
—p;. If there are no additional eigenvectors and p itself labels the basis 
system completely, then it follows from (3.1) that 


Ar|p> = «(p)| —p>. 


where « is a phase factor, which, because of 


A;|p> ="el|p> = «(p)Ar| —p> = =(p)a(—p)|p>. 
fulfills 


ao? =¢€ (= +1 for the spinless case) (3.2) 


and can be shown to be independent of p. 


For the angular-momentum eigenvectors | Ell, > it follows in the same way. 
from (2.4) and (2.14), that 


Ay Ell, — ol Ei — 1, >. (3.3) 
If there are additional quantum numbers y (internal quantum numbers, 
channel labels), then the action of A; on the basis vectors |...4> depends 


upon the relation between A, and n°". Let us assume that n°? does not con- 


+ Proofs of the Wigner theorem can be found in Ludwig, (1954, p. 101); U. Uhlhorn, Ark. 
Fys. 23, 307 (1963); L. O’Raifeartaigh, G. Rasche, Ann. of Phys. 2h, 155 (1963); V. Bargmann, 
Ann. of Math. 59, 1 (1954). See also J. M. Jauch, in Group Theory and Its Applications (Ed. E. M. 
Loebl), Academic Press, New York, 1968, p. 131, where further references are given. 
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tain any internal angular momenta. Let us further assume that all 4°” com- 
mute with A,: 


Aran — (3.4) 


This will be the case if °° stands for the internal-energy operator of the 
target, for instance. Then 


Ar|pn> = a(n)|—pn> and A;|Ellsn> = a(y)|El—13n>, (3.5) 


where a = a(7) may depend upon y and perhaps also upon 1. 

Let us now consider the S-matrix element in the angular-momentum 
basis (XVI.1.42) and the reduced S-matrix elements <y||S,(E)\\7’> defined 
by (XVI.1.46) and (XVI.1.44). One can either use (2.14) for the T-operator 
together with (3.5) and the definition of the S-matrix given by (XVI.1.42), 
or one can use (2.15) for the S-operator directly together with (3.15), to cal- 
culate, using (2.8) and (3.5), 


<Ell3n| APSAz|EUI39'> = (Az| Ells), SAp|El'T31'>)* 
= a(n a*(n')C EI — Iyn|S|ET — Iyn'>* 
= a(n)a*(n') ET; n'|S| El — 131). (3.6) 


On the other hand, the left-hand side according to (2.15) is equal to 


<Ell;n|S*|EVI;n'> = (S| Ell,n>, |E'U13n’>) 
= (E'l'l,y'|S|Ellyn)*. (3.7) 


Thus 
CETI39'|S|Ellgn>* = CET — lyn’ |S|El — lsnda(ya*(y') (3.8) 
or also, equating the complex conjugate of (3.7) and (3.8), 
(EMTs 9'|S|Ell3n> = (El — In |S|EV — Iyn’patno*(n’‘) (3.9) 
In order to determine the phase factors, we calculate 
<Ell,n| Af Ar EVI3n'> = ay)o*(y’)C Ells n| EN3n'>*. (3.10) 
On the other hand, because of (2.12), 
CEllyn| A} As|ETIyn'> = (Ells n|EV3n'> = po (CENCE — E011, 9 
(3.11) 


if the generalized eigenvectors are “normalized” in the usual way with real 
p(E). Consequently, comparison of (3.10) and (3.11) shows that we require 


a(n )a*(n’) = 1. (3.12) 


Equations (3.8) and (3.9) with (3.12) are called the reciprocity theorem? (for 
the spinless case). This relation is quite distinct from the corresponding 


5 The reciprocity theorem is also called the principle of detailed balance or the principle of 
microreversability. 
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relations—e.g., (1.7) or (1.8)—following from parity invariance, as it relates 
two different processes with each other. The left-hand side of (3.9) describes 
the process 7 — n’, whereas the right-hand side describes the process 7’ > 1. 
The reciprocity theorem then states that these two processes have the same 
transition probability. The measurement of these transition probabilities 
provides a possibility of checking whether time-reversal invariance is ful- 
filled. 

We now want to determine the consequences of time-reversal invariance 
upon the /th partial reduced -S-matrix element <y'\|S,(E)\/n> defined by 
(XVI.1.45) and (XVI.1.43). Inserting these into (3.8), we obtain 


<n'|S(E)Ind>* = <n’ |S) IInd; (3.13) 


inserting them into (3.9), 


<n’ ||IS(EDin> = <n|SKED n> (3.14) 


Thus the /th partial reduced S-matrix element <n'\|S,(E)||y7> is not only 
unitary [cf. XVI.2.4)] but also real, i.e., it is a symmetric matrix. We shall 
make use of this fact in the following section. 

For an experimental check of the reciprocity theorem we obtain the 
consequences of (3.14) for the partial cross section. Inserting (3.14) into 
(XVI.1.48), we obtain for the partial-wave amplitudes (XVI.1.34) the fol- 
lowing relation: 


P(E)pTi(E) = / PE)p' Tt, {E’). (3.15) 


And inserting this into (XVI.1.37), one obtains the following relation between 
the partial cross sections of “" and o/~" describing the processes n > 1’ 
and n’ > n, respectively: 


ee (3.16) 


An analogous relation holds for the total and differential cross sections. 
Equation (3.16) has been derived for the case of spinless particles. If pro- 
jectile and target for the system n have the spins s and s;, and for the system 
n’ the spins s’ and s7, then (3.16) is replaced by 


or res p? (2s + 1)Qsz + 1) 3.16) 
p’? (2s' + 1)(2s; + 1) < 


For a check of the principle of detailed balance (3.16), we consider the 
experimental data of Figure 3.1. 

This figure depicts the counting rate (which is proportional to the cross 
section) as a function of the energy for various processes. Parts B and C 
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are for the processes 
Al?7 + prorat Me?4, 
Mg?* +a>p-+ Al?’, 


which are related by the principle of detailed balance, and (3.16) should 
apply, with 7 characterizing the Al?7-proton system and n’ characterizing 
the Mg?*-a system. The energy scale has been appropriately adjusted, and 
the curves should agree with each other (except for a factor). The data were 
taken from two different experiments with different resolution; still, the 
agreement is convincing. The various bumps correspond to resonances of 
the Si28 system, and we shall discuss this, as well as the comparison with the 
curves in parts A and D, in the following chapter. 


Problems 


1. Show that the product of two anti-linear operators is a linear operator and that 
the product of an antilinear operator and a linear operator is anti-linear. 

2. Show that as a consequence of the antiunitarity the time reversal operator must 
fulfill A? = ¢«, wheree = +1 ore = —1. 

3. Show that from the definition of the Meller wave operators (XV.3.7+) it 
follows that Q* = A1Q* 4, if H and K commute with 4,. Use this to show that (2.15) 
is a consequence of (2.14) if the S-operator is defined in terms of the energy operator. 

4. Prove that the time-reversal operator transforms the exact energy eigenvectors 
|p, ‘> into the exact energy eigenvectors | — py” >. (The label + or — refers to the 
definition according to the Lippman-—Schwinger equation.) 

5. Show that the time-reversal operator transforms an out-state into an in-state 
and vice versa. 


6. Consider a multichannel system of spinless particles. Show that the trans- 
formation property under Up and A; implies that the S-matrix in the momentum basis 
is symmetric. 

7. Show that from the Heisenberg equation of motion (XII.1.24) for the position 
operator, A = Q,, with (XIL1.26) and the antilinearity of A; follows that H commutes 
with A;. 


CHAPTER XX 


Resonances in Multichannel 
Systems 


In the present chapter we discuss resonances in scattering processes from an 
initial state into several possible final states, where the internal quantum 
numbers of the resonances are, in general, different from the quantum numbers 
of the initial state and the final states. In Section XX.2 we first discuss, in detail, 
the case of a single multichannel resonance and then the case of a double 
multichannel resonance (which occurs if there are two resonances with 
different internal quantum numbers in the same partial wave, with both the 
resonances coupling to initial and final states). In Section XX.3 the Argand 
diagrams for inelastic resonances are described. 


XX.1 Introduction 


In the discussion in Chapter X VIII we assumed that there is only one channel 
open in the scattering experiment and that it is this particular channel to 
which the resonance phenomenon is restricted. Then there is only one 
reduced S-matrix element <n||S,(E)\/y> that is different from zero, and it is 
this reduced matrix element that resonates. This is the case of elastic scattering. 


a+T-R-a+T. Cit) 


yn are the quantum numbers of the initial state (a, T), of the final state (a, 7), 


and of the resonance R. 
This is, of course, a very special situation, which in general will not be 
fulfilled. When the energy is high enough, so that several final channels are 
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open, a resonance can occur not only in the elastic scattering process 
described by S(E) = <n4\|S(E)\|\n4> but also in the inelastic processes 
described by <n||S,(E)\|n4> with n # 4. In order that one particular reso- 
nance shows up in several channels this resonance state must have nonzero 
transition matrix elements to all initial and final states with quantum 
numbers n, 7’, 7". If one also wants to assign internal quantum numbers, say 
kK, to the resonance state, then these must be different than the quantum 
numbers 7; the corresponding observables «°° and 4°? cannot commute. 

It may be a matter of taste and convenience whether one characterizes 
these resonance states by internal quantum numbers Kk = (kK; K2--- K,) and 
considers the resonance energy as a derived quantity Er = E(Kg), Kr = 
(Kp, Kp2***Kr,) determined by the internal quantum numbers, or whether 
one characterizes the resonance state directly by the resonance energy Er. 
The former view is prevailing in particle physics, the latter in nuclear physics. 
We shall follow here the former view, assuming that the resonance state is 
characterized in addition to the angular momentum by a set of “internal” 
quantum numbers x, because this allows a very simple derivation of multi- 
channel resonance formulas from the results of Chapter XVIII. 


XX.2 Single and Double Resonances 


The reduced matrix elements of the S-matrix <y||S,(E)||7'> form a unitary 
matrix, according to (XVI.2.3). According to a well-known theorem of 
linear algebra,’ for any unitary matrix there exists another unitary matrix 
that diagonalizes it, and the nonzero elements of the diagonal matrix (the 
eigenvalues) are of absolute value one. Thus for any value of E, there exists 


a unitary matrix ¢x||y> (with an adjoint <n||K> = <Klln)), 


KKK = See = LCM <ai>, Syn = Kad = YD Called CeIn 


(2.1) 
such that 


Sy: 770) = YY Cacllen> Cll SE) in") <a'IIk’). (2.2) 


O*(E) are called the eigen-phase-shifts or eigenphases. The matrix <x||n> 
may be different for different values of E and, therefore, in general, a function 
of the scattering energy. From the time-reversal invariance it follows that 
the reduced S-matrix is real [Equation (XIX.3.14)]. From the general 
theorems! of linear algebra it follows that for a real unitary matrix there 
exists an orthogonal matrix that diagonalizes it. Thus, the ¢x||7> fulfill not 
only (2.1) but also 


<Klln> = Callk> = C«lln*. (2.1a) 


' See, e.g., A. Lichnerowicz (1967) or V. I. Smirnov (1964, Vol. III, Part 1, Section 42). 
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Inverting (2.2) with the help of (2.1), we obtain 
<n|SKE)\in’> = Ye???) n |] k> <xlln’d. (2.3) 


To understand the physics behind this mathematical transformation of 
the reduced S$ matrix into diagonal form, let us consider the basis vectors 
that were used for the space of the initial states (a, T): 


| Ell,n> = |Ells> ® |). 


The orthogonal matrix <y||K> transforms this basis system into a new basis 
system 


|Ell,x> = ) | Ellsn><nllx> (2.4) 


with the quantum numbers Ell, unaffected but with new internal quantum 
numbers x. The observables whose eigenvalues are x do not commute with 
the observables whose eigenvalues are n: 


ieee | oO: (2:5) 


The quantum numbers y were chosen so that the initial state (a, T) is charac- 
terized by a definite value of 4: n = n4 = n(a, T). Now it may be that the 
quasistationary state R that is formed in the processa + T>R->a+T 
does not have a definite value of the quantum numbers y but a definite value 
of the quantum numbers k, say kK = Kg. Then we can repeat the same con- 
siderations as in Sections XVIIJ.2 and XVIIH.6—this time not for the in- 
ternal quantum numbers yn, but for the internal quantum numbers x, in 
particular for that value xp of the quantum numbers x that constitutes the 
internal quantum numbers of R. The time delay t{}®) and phase shift 4)” are 
then characterized by this quantum number in the same way as in Chapter 
XVIII they are characterized by the quantum numbers yn, of the initial (and 
final) projectile-target system (a, T). Thus it is not for the quantum numbers 
n4 Of the initial state (a, T) that (XVIII.6.23) holds, but for the quantum 
numbers Kp of the resonance state R, which now has quantum numbers that 
are incompatible with the quantum numbers n,, i.e., they fulfill (2.5). Thus 
instead of S, = <n4l/S(E)|In4>, we now write Equation (XVIII.6.23) for 


CKrl|S) (E)||KRD: 


(Krll/S(E)\\Kz> = ei2Ficr)pi2vlkr) — gi2di(rr) 


iT'p i2yi(kr) 
_ i21(eR), 2.6 
(: ssh eh ale 2) 


OF (kp) here denotes the resonant phase shift, and (Kg) the background 
phase shift, so that the phase shift is again written 
OK) = Of(KR) + (Kr). 


The reduced S-matrix elements <n||S,(E)\|n’> between the (reduced) basis 
vectors ||7> have then to be expressed in terms of the reduced S-matrix 
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elements <x||S,(E)\|K’> between the (reduced) basis vectors ||K>, using the 
transformation 


In> = YlK><xlln>, (2.4') 


which leads to (2.3) with 
Be 22) = Cae Silk’. 


The matrix element <x||y> is the number that characterizes the strength 
of the coupling between the states with the internal quantum numbers 
and x. As mentioned above, they depend in general upon the energy E. 

It may of course be that a quasistationary state is formed not only for 
one value Kg of the quantum numbers x, but also for other values kK = 
Ky,,Ky,,---- This may even happen at the same value / of the angular mo- 
mentum and even at close energies Ez and Ex,. We assume for the sake of 
definiteness that there are two values kg and Ky for which quasistationary 
states are formed with angular momentum /, and their energies and widths 
are (EgI'pg) and (EyI’y), respectively. Then in addition to (2.6) one has also 


: iT . 
: es 2a (een): = 5 ieee 07 
<Ky||S(E)||Kx> é ( + Ee = E = =a) e ( ) 
We also assume that all the other S-matrix elements are nonresonant, i.e., 
CISKE\ (Kk = e'2°") = e247)" “for k eiten hy, (2.8) 


where (x) are the background phase shifts, which are functions of energy 
E. One may be tempted to assume that the background phase shifts in the 
resonant and non-resonant channels are slowly varying functions of the 
energy. This is, however, in general not the case’* and we shall not make 
use of such an assumption here. 

We shall call the above-described object a double resonance or double 
multichannel resonance of quantum numbers (kp, Ky). Thus the formation 
experiment in the scattering process 


a+ Toa+T 
can go two ways: 


Rey -n. KR ’ ' 
(a ate Dein q-n. NA Pe oe y i +a with q-n. n° 


with g.n. Kx 


‘a These rapid energy variations follow from Wigner’s eigenphase repulsion theorem: The 
eigenphases 0,(k) do not cross (modulo z). As a consequence, near the resonance energy in 
the channel kp the eigenphases in all other channels « must also vary. This is assured by the 
existence of branch cuts in the individual eigenphases (and the corresponding transition matrix 
elements <x| > which are also functions of E) which are in the complex energy plane much 
closer to the real axis than the resonance pole. These branch cuts do not occur in the complete 
S matrix and therefore do not have physical significance. The occurence of these branch cuts 
in the <x|\7> and yA) must be just such that all these branch cuts cancel out in the background 
term (2.16), which can be a slowly varying function of energy. A discussion of these singularities 
of the individual eigenphases is given in C. J. Goebel, K. W. McVoy: Phys. Rev. 164, 1932 (1967); 
H. A. Weidenmuller: Phys. Letters 24B, 441 (1967); and R. H. Dalitz, R. G. Moorhouse: Proc. 
Roy. Soc. A318, 279 (1970). 
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Figure 2.1 


In the case of elastic scattering 4 = n,, (T' = T, a’ = a), in the case of a 
reaction 7 = r # ny. If one wants to illustrate this by a diagram alalogous 
to Figure XVIII.1.2, one would draw Figure 2.1. 

The S-matrix for the double multichannel resonance is obtained by in- 
serting (2.6), (2.7) and (2.8) into (2.3):? 


<niSHEDn’> = ZL <i> cel Dela 4 ERS E, -E-iW,2 


it iT2<n\|kx> Cnxlla ew” (2.9) 
E, — E — iT y/2 
The sum over x extends over all channels including the resonating channels 
Kp and ky. 
The amplitude for elastic scattering connected to the S matrix by 
(XVI.1.48), or (XVI.2.8), is obtained from (2.9) with 7 = n' = ng after a 
short calculation using (2.1): 


pT(p) = e2"*#) 2T el Cnallke>|? is a0 x <ialltx? Ves 
Er — E — il p/2 Ex, — E — iT /2 


+ Yi <nalle><xlingye™ sin (x). (2.10) 


In the case of inelastic scattering from the initial state with quantum numbers 
ny’ = nq into the final state with quantum numbers y = r, the scattering 
amplitude (XVI.1.48) is obtained from (2.9) as 


aT AN ><KrllNa> 
(Ar) ei27(kR) R R RITA 
J PPT 1 (p) = ie —ir,/2 


ip ST yCr|lkx> Geuliupe ae” 
Ey — E — il ;/2 


Re 2 <rilx> <K linge? sin y(K). (2.11) 


Equations (2.9)-(2.11) are approximations to the same extent to which (2.6) 
and (2.7) and (XVIII.6.23) are approximations. They have been derived here 
from the ordinary Breit-Wigner approximation and the assumption that 
the resonance (or quasistationary state) has internal quantum numbers k 


2 We omit the index / in the phase shift 7, and 6, whenever possible. )' then always denotes the 
background phase of that partial wave in which the resonance occurs. 
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incompatible with the internal quantum numbers » of the initial and final 
state. Due to this derivation (2.9)-(2.11) contain the individual eigenphases 
y(x) though these eigenphases belonging to the intermediate quantum 
numbers « may be unphysical quantities. The physically observable quantities 
are the resonance terms and the total background term (summed over all x) 
and only these quantities should be compared with the experimental data. 


Single Multichannel Resonance 


Before we continue the discussion of a double resonance, we treat the simpler 
case that there is only one resonance with quantum number kr. Such a 
resonance we shall call a single multichannel resonance. This case is obtained 
from the above formulas (2.10) and (2.11) by omitting the second term in 
the bracket. 

It is customary to introduce the following notation: 


ry? = <nixot'?, T=Te. (2.12) 


At this point we use time-reversal invariance. If time-reversal invariance 
holds, then the unitary matrix <y||K> is orthogonal, 1.e., the matrix elements 
are real [Equation (2.1a)]. Thus for time-reversal-invariant interactions 
pe are real. 

Because of (2.1), the T’, fulfill 


= Sa (2.13) 
I, is called the partial width for the channel 7. With this notation the elastic 
scattering amplitude for a single resonance of quantum number xk is written 


Eye 


— pi2y(kr) 


and the reaction amplitude is given by: 
place 


Te — pi2r(kr) 
PPT, (p) = € E,-E-i 


5 + b(r), (2:15) 
where 
b(n) = ¥ <ul> CK linge? sin p(k). (2.16) 


which can be a slowly varying function of energy even if the individual y(x) 
vary rapidly. The representation (2.14), (2.15) is called the generalized 
Breit-Wigner approximation. These expressions are very similar to the 
corresponding expression (XVIII.6.26) for the one-channel resonance, ex- 
cept that the height of the multichannel resonance is smaller by the factor 


* If time-reversal invariance does not hold, F} ? is complex and one defines I, by, = |P} /?. 
We shall not discuss this case here. 
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<nallka>? or <qyllKg><rl|Kg>. The first factor represent the transition am- 
plitude (square root of the transition probability) for a transition from a 
state with quantum numbers ny, to a state with quantum numbers Kp and 
back to a state with quantum numbers 7, ; the second factor represents the 
transition amplitude from n, to Kg and from xg to r. Thus the smaller height 
is due to the inelasticity, i.e., the fact that the resonance contributes not only 
to the channel n, or r, but also to all the other channels 7 4 yn, ory # r. 

The cross sections corresponding to the amplitudes (2.15) will then not 
look much different from the expressions for the cross sections ae in 
Sections XVIII.6 and XVIIL.9. Using the abbreviations: 

Er, —E 


R = R" = Cy||kp><Kalinad, (2.18) 


where Kp are the quantum numbers of the resonance R, and n, the quantum 
numbers of the intial state A = (a, T), one can write the /th partial cross 
section o{4~” using (2. - for 7 = yn, and (2.15) for 4 = r with (2.16): 

2 


off" = n(2l + 1) >| 5, <n <egde™ sin ye) + e299 


Cat 


(2.19) 


After some calculation this can be written as 


of4°) = 4n(2] + N= joan 


tory (R + © Cll) <r nad sin? (%)(2¢ cot (x) — 2) 


(2.20) 


where b(n) is given by (2.16). This is the elastic cross section for 7 = n, and 
the reaction cross section for n = r. 

For an energy E far away from the resonance energy Ep, 1.¢., for |e| > 0, 
the second term in the bracket on the right-hand side of (2.20) is very small 
and vanishes for |€| + oo. The cross section is then given by 


f°" > Gay = 4n(2l + 1) \b(n) |? for Je] -> 00. (2.21) 


Thus we see that (2.20) is very similar to (XVIII.9.14): a single resonance in 
the many-channel case will appear as a structure above a gradually varying 
background. The shape of this structure will in general not be the same as 
the one given in Figure XVIII.9.2 and will depend upon the various back- 
ground phase shifts y. However, if these background phase shifts are, in the 
neighborhood of the resonance energy E ~ Eg, approximately the same 
modulo z for all quantum numbers x: 


yk) = y(E) = y(E), (2.22) 
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then one can write (using <y74|n4> = ¥. <nall> <Kllna> = 1, 
Rq? + 2g¢eg —-2+R 


qe” aa Ons) SF Rof{?® for Nn =a (2.23) 


e+ 
where q is again defined by 
= cot »(E) (2.24) 
and oP is given by 
‘ 1 
o(?®) — 47x(21 + 1) a sin? y. Gps) 


Equation (2.23) is, indeed, very similar to (XVIII.9.14) with (XVIII.9.6). 
The shape of a single resonance in the many-channel case will, therefore, 
be very similar though not identical to the shape depicted in Figure 
XVIII.9.2.** 

From (2.20) we see that the multichannel resonance shows up in the 
elastic channel, 7 = n4, as well as in all the reaction channels n = r, at the 
same energy Ep and with the same width T. The shape of the resonance 
depends upon the background phase shifts y(«). And even if (2.22) holds, 
the resonance with quantum number Ke will have different though similar 
shapes in the various channels n, because R in (2.23) depends in general 
upon y according to (2.18). 

However, if one has the particular case that the background for all quan- 
tum numbers k—the resonating kK = Kp and all the nonresonating x—is 
negligible, then one has 


1) te al, Dua 
p? (Ex — EY — 1/4" 


of RiA>M — 4n(21 + (2.26) 
Thus, the resonance without background appears in all channels as a Breit- 
Wigner resonance at the same energy and with the same width, but with 
different heights depending upon the value of the partial width T,,. 

Further, as 7, can really take any value, the same Breit- Wigner resonance 
also appears for any initial state. Thus, independent of what the initial and 
final states of a scattering process are, if only their transition amplitudes to 
the state with quantum numbers xk, are different from zero, the resonance will 
appear at the same energy and with the same width. This is, of course, what 
one must expect of a quasistationary state, which is entirely determined by its 
“quantum numbers” (Er, I, /, Kg) and independent of the experimental 
conditions.* 


** If all the background eigenphases are equal they can be energy independent cf. footnote 
on p. 2-4. 

* As we have remarked already in Section XVIII.6, the notion of the quasistationary state 
(Ex. I, 1. kp) has a meaning only up to a certain accuracy. If the energy dependence of T has 


to be taken into account, then the apparent position, EX®*'’*®, may be different for different 
experiments. 
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Experimental examples of single multichannel resonances are shown in 
Figure XIX.3.1. This figure shows the various resonances (“energy levels”) 
of Si?® as they occur in the processes 


Ay Al?’ p> Gil*)*= p + Al?’, 
B: Al?’ + p— (Si28)* + a + Mg”*, 
C: Mg** + a (Si?8)* = p + AP’, 
D: Mg?* + a — (Si28)* > a + Mg?*. 


(Si?®)-resonances are observed at the following energy values:° 2.75, 2.93, 
2.95, 3.00, 3.14, 3.20, 3.28 MeV. Fine discrepancies between the curves—like 
the double peak around 3.4 MeV in process B, which is matched only by a 
single peak in process C—can be explained in terms of differences in the 
energy resolution for the experiments. 

Let 1, denote the quantum numbers of the p-Al*” system, r the quantum 
numbers of «-Mg7*, and kz,, Kp,,... the quantum numbers of the various 
(Si?*)-resonances. The energy values of these resonances are sufficiently far 
apart that one can consider each separately as a single resonance and need 
not apply the considerations for a double resonance described below. 

We see that in general, each single resonance occurs in all four processes. 
However, whereas in the reactions B and C they usually occur as pure Breit- 
Wigner resonance, in the elastic processes A and D they have considerable 
background phases. For example, the resonance at 3.203 MeV, whose 
quantum number we may call Kp,, occurs in B and C as a bump and in A 
and D as a dip. This indicates that (y4||Kp,> as well as <r||Kp,> are different 
from zero, and that the sum over x in (2.20) is negligible for y = r but not 
negligible for 4 =, = (quantum numbers of the p-Al?’ system) and 
” = 4 =r = (quantum numbers of the a-Mg** system). A possible ex- 
planation for this effect is that the background phases )(x) are not negligible, 
but approximately independent of x, so that for the elastic processes some- 
thing like (2.23) holds. Under the same assumption one obtains for the 
inelastic processes, using <y|y4> = 0, 


K K : ' 
of-" wo, + SL (2.23’) 


so that in a reaction the resonance occurs as a pure Breit- Wigner above a 
possible background a,,. 


5 These are the energy values on the «-particle energy scale. The difference in the origin of the 
energy axis for the «-particle energy and the proton energy is 1.61 MeV, and the energy scales 
of the processes A, B and C, D have been adjusted by bringing each pair of resonance peaks of 
the reaction curves B and C into coincidence. 
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Double Multichannel Resonance 


We now return to the case of two resonating Ith partial waves: one with the 
internal quantum numbers xz and energy and width (Er, Iz), and the other 
with internal quantum numbers xy and resonance parameters (Ey, Ty). We 
start from (2.10) and (2.11) and consider simultaneously the cases of elastic 
scattering of (2.10) 7 =n, and of the reaction (2.11), 7 =r. We use the 
notation 


R = R" = (alike) Ceallta), 
X =X" = Caley) Cealinarernr 1. oe 


We shall always assume time-reversal invariance (2.1b), so that R is real 
and the phase of X is entirely determined by the difference of the back- 
ground phases. Equations (2.10) and (2.11) are then written 


(T’p/2)R" i (I',/2)X" 
seer ep eS ail 


pT(p) = e'?%«) (; + 6), (2.28) 
where b is given by (2.16). 

To simplify our following discussion we ignore the background term b. 
It will have effects similar to those discussed above for the single resonance 
and in Section XVIII.9. 

The cross section for the double multichannel resonance then becomes 
(if we assume for the sake of simplicity that X is real): 


(TR/4)R* a (1x/4)X? 
(By) ea (ee ee 


(Ue/4)0x2RX[(Ex — E)(Ex — E) + aes 
[(Ex ~~ E)? + (VR/4) (Ex — E)? + (x/4)1 7 


1 
of” = an(2l + 1) 5 ( 


(2.29) 


Thus the /th partial cross section is the sum of two Breit-Wigner cross 
sections, with different widths and different heights, and a third term (the 
interference term). A double resonance is characterized by four parameters 
(Eg, Tp; Ex, Fy) and the quantities R and X, which depend according to 
their definition (2.27) upon the final-state quantum numbers y and also 
upon the initial-state quantum numbers y,. Thus for one channel R and X 
may have the same sign, and for another channel they may have opposite 
signs. The shape of the cross section of a double resonance, therefore, varies 
with the channel quantum number. If in one channel it appears as a double 
hump, it may show up in another channel as one single broad hump. The 
most favorable circumstances for a double hump are R ~ X. Then the third 
term is (2.29) has two maxima, one near E = Ep and the other near E = Ey, 
which enhance the humps at these values that come from the two Breit- 
Wigner cross sections in (2.29). Then the double resonance will appear 
in the cross section as two maxima with different widths, provided the back- 
ground can be ignored. 
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To obtain an idea of the possible shapes of such a double resonance, let 
us consider some special cases. It is useful to introduce the new variable 


E E 
6=E+ was (2.30) 
and denote 
A —! Er aoe Ey. Can 
Then the cross section formula (2.29) is written as 
an RT? ya 
o(®) = 4n(21 + 1 2 + * 
? a fee — 6) + (TR/2 — (—A/2 — 6)? + (Py/2)? 
2RX(6? — (A/2)? + Fal y/4 gly (2.32) 
[(A/2 — &) + (Tr/2)7(—A/2 — &) + (,/2P7 1) 


The interesting situation arises when Ez is close but not identical to E, and 
if the widths of the two resonances have approximately the same magnitude 
as A = Ep — Ey. Therefore we shall consider the case 


Ip = Ty — A. (2:33,,) 


Note that in this case the third term in (2.32) has a zero and a minimum 
Gif sign R = sign X) or maximum (if sign R = —sign X). With (2.33) one 
obtains from (2.32) 


ii [? R2 
Sar 2) Dr Te (aaa 
x? 2RX&? 
* (-Ap = 6 + App” 84 ol (2.34) 


The term in the bracket of (2.34)—i.e., o® except for the factor 


jay 

4n(21 + 1) = — 

—is depicted in Figure 2.2 for various values of X (R = 1). Ignoring 
ef20(KR)— ("x X is connected to the coupling constants according to (2.27) by 


= nllkx> (KxllN4> 
<nllKe><Ke\INa> | 


The correspondence between the curves and the values of X is as given in 
the legend. We see that the cross section may look quite different for dif- 
ferent couplings between the intial and final channels and the resonance 
channel. For X = 1! one obtains a double peak with the maxima slightly 
below E = Ey (6 = —A) and slightly above E = Ep (6 = +A). For 
X = —1, the cross section has one single broad bump centered around 
E = 4(Eg + Ey). For X = 0 there is, of course, only one bump at E = Ep, 
representing the resonance R, as in this case the resonance X does not couple 
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of) 


Figure 2.2 Cross section for double resonance: the case (2.33,) for 
various values of X: 


Symbol: * + V A O 
xe 1.0 0.5 0.0 —0.5 —10 
to the channels 7 or n4. But also for X = —O.5 there is only one single 


bump, which however does not have the form of a Breit-Wigner resonance. 
Thus, for different final channels 7, a double resonance may show up in the 
cross section as a double peak, as a broad single peak with maximum between 
the two resonance energies, or as a single peak around one of the resonance 
energies. 

The cross sections in 1 Figure 2.2 correspond to the most symmetrical case, 
(2.33,). If Py or Tp differ from A the picture changes slightly. Figure 2.3 
depicts (2.32) [except for the factor 42(2! + 1)(1/p’)I'*] for the case 


fA. Sie (2335) 


The variation in the shape of the cross section with the coupling between 
the channels is similar to that in Figure 2.2. 

In the above considerations we have ignored the background term b, so 
that the above discussions are valid only if (2.16) vanishes. Also we have 
ignored y(ky) — y(Kr). From our discussions in Section XVIIL.9, and from 
the footnote on p. 466 we know already that the background phase shifts 
may be large, so that this idealization may well be inadmissable. Taking the 
interference between resonances into account will make the complicated 
features of a double resonance even more complicated. Taking in addition 
the background from other partial waves and the resolution of the experi- 
mental apparatus into account, one concludes that the observation of a 
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Figure 2.3. Cross section for double resonance: the case (2.332). 
Symbols as in Figure 2.2. 


double multichannel resonance as a double peak in the cross section is an 
unlikely possibility. Thus two resonances with different quantum numbers kp 
and ky but with energies Ep and Ey, that differ from each other by an amount 
comparable with the width and that both couple to the initial and final 
channel, will be very hard to observe. 

Nonetheless, examples of double resonances have been seen in nuclear 
scattering processes. The most prominent examples are the Be® energy 
levels with (angular momentum)?*"'Y = /? = 2*. These resonances have 
been observed in the processes 


He* + He* > (Be®)* > He* + Het (2.35,) 
and 
He* + He* — (Be®)* — Be® + y, (2.35) 


at excitation energies of 16.6 and 16.9 MeV of the Be®, with widths of ap- 
proximately 110 and 80 keV, respectively. We do not want to describe the 
details of this system, and shall just establish the connection with the quanti- 
ties of our formulas (2.28) and (2.29). We ignore the background phases; 
then (2.28) can be written 


a0 x<nallkx> <Kxlln4> 2ER<nallKr> (Kp Ina> 


Ana — LE 
pT ee en OD. Reese — ie 


(2.36) 


476 Resonances in Multichannel Systems 


Corrected counts 


17.42 17.22 17.02 1682 1662 1642 16.22 
Exitation energy in MeV of Be® 


Figure 2.4 Fits of the doublet structure in Be® in the He*-He* 
channel. Note that the energy decreases from left to right. [From 
W. D. Callender et al.’ with permission. ] 


and 


51 y<n\lkx> CKx||N a> WT r<nilke><Kellna> 
E, —E -iWf,/2 E, — E — iF p/2 


Here |74> denotes the internal state of the (He*, He*) system, |7> denotes 
the state of the (Be®, y) system, and |x,> and |p» denote the internal state 
of the Be** (16.6 MeV) and Be®* (16.9 MeV) respectively, so that (2.36,) is 
the scattering amplitude for the process (2.35;). Depending upon the ratio 
<nallkx>?/<nallkr>’, the cross section for the process (2.35,) will then have 
the shape of one of the curves in Figure 2.2 or 2.3. The cross sections have 
been measured®:’ and the experimental data points together with a fit to 
the data are plotted in Figure 2.4.’ 


pTi "= (2.362) 


° A. D. Bacher, F. G. Resmini, H. E. Conzett, R. deSwinairski, H. Meiner, J. Ernst, Phys. Rev. 
Letters 29, 1331 (1972). 


7W. D. Callender, C. P. Browne, Phys. Rev. 2, 1 (1970). This figure does not give the cross 
section for the formation process (2.35,) but for the production process 


BI° + d—+a+ Be®* 


ie ake 
R and X in (2.29) are therefore not 


R = Cnallta><Keling>, X = (NAllKX)<KX||\N AD, 
but 
R = gr<Kr\lna>, X = gx¢KxllN4> 
with 


Gr * Gx: 
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Yield (arbitrary units) 


16.5 166 16.7 168 169 17.0 
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Figure 2.5 Fits of the double structure in Be® in the Be’-y channel. 
[From Nathan et al.®: Phys. Rev. Lett. 35, 1137 (1975), with permission. ] 


The curve in the upper plot shows a fit to the data obtained with an ex- 
pression (2.29) for a double resonance. The lower plot illustrates the fit 
with the sum of two Breit-Wigner cross sections, 


a , A; 
(Ey — FE)? + (1 x/2 (Eg — E)? + (FR/2)?’ 


which would hold if the states at 16.6 and 16.9 MeV were unrelated single 
resonances. This fit with two noninterfering Breit- Wigner curves is clearly 
unacceptable, whereas the fit in the upper plot with a double resonance (two 
interfering Breit-Wigner curves) is excellent. Thus we conclude that the 
double-peak structure observed around 16.6 and 16.9 MeV of the Be®* 
system is a double resonance. Both bumps have approximately the same 
height, the bump at 16.9 MeV is slightly narrower, and the fitted curve re- 
sembles the curve (marked x ) in Figure 2.3 for X/R ~ 1. Both states |Kp> 
and |xky> couple, therefore, with approximately the same strength to the 
incident channel: 


(2.37) 


Cm 


<Kxllna> © Kerlin. (2.38) 


Figure 2.5 shows a plot of the experimental data® for the reaction (2.35)). 
The solid line is a fit with (2.29); the dashed line is a fit with (2.37). Again the 
sum of two noninterfering Breit- Wigner curves is clearly ruled out, and a 
double resonance gives a very good fit. This time, however the curve re- 
sembles more the curve (marked +) in Figure 2.3 for X/R ~ 0.5. From 


8 A.M. Nathan, G. T. Garvey, P. Paul, E. K. Warburton, Phys. Rev. Letters 35, 1137 (1975). 


478 Resonances in Multichannel Systems 


this and (2.38) we conclude that the coupling of the two |kr> and |ky> to 
the outgoing channel (Be®, 7) fulfills the relation 


K<nllkx>| < l<nllke? |. (2.39) 


The experimental data for the Be® system around 16.6 and 16.9 MeV give 
a beautiful illustration of the fact that a double resonance may show up in 
the cross section with a variety of shapes. 


XX.3 Argand Diagrams for Inelastic Resonances 


In Section XVIII.8 we discussed the behavior of pT,(p) in the Argand diagram 
for the case of purely elastic scattering. In the second part of that section 
we also described how T,(p) can be obtained from the experimental data. 
In the present section we shall give a brief discussion of the behavior of the 
elastic partial-wave amplitude p7,(p) in the presence of inelastic processes 
and of the partial-wave reaction amplitude ./p,pT,, (p). The determination 
of these amplitudes from the experimental data follows essentially the same 
procedure as described in the second part of the Section XVIII.8: for the 
determination of T;(p) one uses the elastic differential cross sections, and for 
T,.(p) the differential cross section for the reaction (initial state 44) —> 
(final state r). 

We will now assume that there is a single multichannel resonance and 
discuss how this single resonance will show up in the Argand diagrams for 
the elastic scattering amplitude for the reaction amplitude. We first treat 
the idealized case without background. The scattering amplitudes are then 
given according to (2.14) and (2.15) by 


1 
3. 
oo) —————— 
e | (P) E, -E—irp (3.1) 
Iplepi/2 
TR) = ae ; ; 


Here p is the initial momentum and p, is the final momentum, which is a 
function of the initial momentum and the difference of internal energies in 
the initial and final states.” 

Ir, and I’, , are the partial widths (2.12) for the reaction channel r and for 
the initial channel n,, respectively. Ep and Tz = TF are the energy and total 
width of the resonance. We use again the abbreviation 


¢ = (Ex — B= (33) 


’ For a massive nonrelativistic projectile of mass m, before and mass m, after the reaction, 
P, = Pp is given by the formula (XVI.1.24). 
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and also define 


ve = <n|lka> = V/T,/T. (3.4) 


Ves may be negative. 
Equations (3.1) and (3.2) can then be written 
(R) _ Xn 
pT; = ? (3:5) 


Vb.pT i =X, (3.6) 


Xn, = <NallkR>? is called the elasticity of the resonance and is the transi- 
tion probability for a transition from a state with quantum numbers kp into 
a state with quantum numbers n, at a particular value of E. Thus Xy, Tep- 
resents the “fraction” of the resonance with internal quantum numbers kp 
that is coupled to the elastic channel 4,. For X,, = 1 the resonance couples 
to the elastic channel only, all x, with 7 4 44 are zero, and pT‘*) describes 
the unitary circle for an elastic resonance as described in Section XVIIL8. 
The elasticity x,, is related to the inelasticity coefficient 7,(E) defined in 
(XVI.2.10). At the resonance energy this relation is n(E) = 2x,, — 1. In 
contrast to n(F£), which has a strong energy dependence, empirical data 
show that the elasticity x, , is only very weakly energy-dependent and is 
therefore a more convenient parameter. 

The Argand diagram of (3.5) is depicted in Figure 3.1.'° It is a circle of 
diameter x, , lying inside the unitary circle. The resonance energy, ¢ = 0, 
lies on the top of the circle, and the points « = £1 correspond to E = 
Er + I/2. One has to distinguish two cases: 


(a) x,, = 0.5; then the resonance occurs at a phase-shift value 
Oy" (Ep) = 1/2. 
(b) x,, < 0.5; then the phase shift at resonance is 6"4(E) = 0. 
However, there is no qualitative difference between these two cases; in both 


cases the resonant cigen-phase-shift 6“"'(E) goes through 2/2 at E = Ep. 
The Argand diagram for the resonant reaction amplitude (3.6) is depicted 


in Figure 3.2. For the diameter of the circle one takes (eae |. It lies 
inside the circle of Figure XVI.3.2 for the reaction amplitude, and 


x <4. 


rXna 


If the background is included, the elastic scattering amplitude (2.14) is 
written 


pT, = e!?” Sane anh (3.7) 


10 Figures 3.1, 3.2, and 3.3 are from Barbaro-Galtieri (1968). 
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Figure 3.1 The resonant amplitude for the elastic channel is shown for 
two different values of the elasticity x, ,: (a) x,,, = 0.75 and the phase 
shift for the amplitude goes through 90° at the resonant energy; (b) 

X,, = 0.4 and 6 = 0° at the resonant energy. Notice that the phase of 
the resonant amplitude in the resonance circle goes through 90° in both 
cases. [From Barbaro-Galtieri (1968), with permission. ] 


Figure 3.2 The resonant amplitude for scattering into an inelastic 
channel. [From Barbaro-Galtieri (1968), with permission. | 
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Figure 3.3 Elastic scattering amplitude for the case of a resonance and 
a background amplitude. If the background is assumed to be constant 
throughout the resonance region, the amplitude pTis on the circle with 
center C’. In this figure the background amplitude is not purely elastic, 
but presents some absorption. [From Barbaro-Galtieri (1968), with 
permission. | 


The graphical representation of (3.7) for y = 0 is obtained from the 
graphical representation of (3.5) in Figure 3.1 by displacing the circle by 
the vector b and starting the resonance circle at the endpoint B of b. The 
resonance energy is then at the top of this resonance circle. The effect of the 
background phase shift y is then a rotation of the resonance circle by the 
angle 2y. The resulting Argand diagram for (3.7) is depicted in Figure 3.3. 

The Argand diagrams depicted in the figures of this section and also in the 
figures of Section XVIIL8 are idealizations. The energy dependence of the 
background b(E), the background phase shift y(E), and the elasticity dis- 
torts the circular behavior of the amplitude. An example of an experimental 
Argand diagram, obtained from the experimental data by a procedure 
described in the second part of Section XVIIL8, is depicted in Figure 3.4 
below. 

The existence and nature of a resonance in the Argand diagram are then 
determined by comparison with the idealized cases depicted in the above 
figures. If there is an indication of a circle, one takes this as a sign for a 
resonance. The resonance energy Ep is then determined as that value of E 
at which p7,(E) and the phase shift 6(E) change most rapidly. Around Er 
the path pT,(E) may show as a segment of a circle, and the elasticity x, , is 
then obtained as the radius of this circle. The total width may then be 
obtained from the energy difference between the points 4 circle to the left 
and + circle to the right of the resonance energy on the circle segment. In 
this way Argand diagrams have been of great help for displaying resonance 
behavior and for determining the resonance parameters. 
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(~ 1680) 


(~ 1550 and 1715) 


Figure 3.4 Experimental Argand diagrams for some pion-nucleon 
partial-wave amplitudes, [From A. H. Rosenfeld et al., Data for 
Elementary Particle Phpsics, 1965, with permission. | 


As an illustration, we depict in Figure 3.4 Argand diagrams obtained from 
the experimental data for some partial-wave amplitudes in z-p elastic 
scattering.'’ Each diagram shows the unitary circle and, inside it, pT7,(E) 
for various partial waves. (For 2-p scattering there are several partial waves 
for each value of /, characterized by isospin J and total angular momentum 
j = 1+ 4; we cannot discuss these things here, they are also of no relevance 
for our present discussion.) The numbers at the curves for pT(E) indicate 
the energy of the z-p system in MeV = 10° eV. The second diagram (labeled 
by P33) shows the almost purely elastic resonance A at the energy 1236 MeV. 
The Argand diagram is almost exactly as described in Figure XVIIL8.1. 
The third diagram (labeled $,,) is the Argand diagram of another partial 
wave with two resonances, one around 1550 MeV and the other around 


'! zt stands for z-meson and p for proton. 
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1715 MeV, both inelastic. The first diagram (labeled S;,) shows another 
case of an inelastic resonance around 1680 MeV. We see from these examples 
that the behavior of different partial-wave amplitudes is quite distinct and 
in general far from the idealized behavior. The partial wave in the second 
diagram, P,,, is in fact an exception; all other resonances are inelastic and 
have a large background term. 


CHAPTER XX] 


The Decay of Unstable Physical 
Systems 


In Section XXI.1 the decaying state is introduced as a resonance for which 
the production process is ignored. Section XXI.2 gives a heuristic discus- 
sion of decay probability (decay rate) and its measurement. Section XXI.3 
describes a decaying state by a generalized eigenvector of a Hermitian energy 
operator with complex eigenvalue. Using this novel concept, the calculation 
of the decay rate is very simple and is given in Section XXI.4. Section XXI.5 
contains a discussion of the partial decay rates and the use of various basis 
systems for their calculation. 


XXI.1 Introduction 


In Chapters XVIII and XX we discussed scattering processes in which an 
unstable physical system was formed as a quasistationary intermediate 
state. The measurements were performed at the initial state, which was 
prepared in the distant past, and at the final state, which was observed when 
the interaction was no longer in effect and the quasistationary state had 
ceased to exist. We have already seen in Chapter XX that the property of 
the quasistationary state is—at least to a high degree of accuracy—indepen- 
dent of the process in which it is observed. Its characteristics are the energy 
Ep, the width I, and the internal quantum numbers kg. 
The formation process 


a+T—>R-a+T (1.1) 
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Initial state Final state 
with quantum Resonance with Ex, I, kp with quantum 
number n, _ hfumber r 
Figure 1.1 


in which the quasistationary state is detected is depicted by the diagram in 
Figure 1.1, and (ignoring background) is described according to (XX.2.15) 
by the scattering amplitude 
| By 72 
PPT = Celi) gs Sal) (1.2) 
If I is sufficiently small and the delay time according to (XVIII.6.36) and 
(XVIIL.6.43) sufficiently large, one may ignore the formation process. One 
then starts the description at a given time, t = 0, with a physical state 
W(t = 0) = W®, and describes only that part of the process of Figure 1.1 
which is depicted by Figure 1.2. R is then considered as a decaying physical 
system or a decaying state of a physical system. 
Examples of decaying states are numerous in physics; “real” stable states 
are very rare. The radiative transitions of excited atoms or molecules A* 


to lower states A, 
A*+>A+y, (1.3a) 


are typical examples of decay processes. The unstable physical system had, 
of course, first to be formed, and this may have happened by a process like 


y+ A A*, (1.3b) 


But for the excited states of atoms one ignores the mode of formation and 
starts with an initial state W® describing the decaying system A*. 

The crucial problem is thus the choice of W* at the arbitrarily chosen 
initial time t = 0. From W®, if the time development is described by a 
Hamiltonian H according to the basic assumption V of Chapter XII, one 
can then obtain the state of the decaying system at any later time t by 


W(t) = eHtwRet itt, (1.4) 


Equation (1.4) is valid if the decaying system is isolated and evolves un- 
disturbed. Whether for a realistic decay process the decaying system really 
evolves undisturbed is questionable, because it almost unavoidably inter- 
acts with its surroundings, especially when the decay process is observed.’ 
Therefore the decaying system may be subjected to measurement processes 
occurring at random times. For example, a decaying particle in a bubble 
chamber leaves a track of bubbles caused by the interaction of the decaying 
particle with the environment. Each bubble means a measurement in which 


' A Beskow, J. S. Nilsson, Arkiv for Fysik 34, 561 (1967). 
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R with Eyl’ 


Figure 1.2 


the system has been found undecayed. Such a measurement at time f, 
changes the state, according to the basic assumption IIIb, into the state 


W'(t,) = ApW(t1)Ab, (lS) 


where A, is the projection operator on the space of states of the undecayed 
system or a subspace of it. Thus in addition to the change of state (1.4) one 
has to consider the randomly occurring change of state (1.5). 

Another problem is the existence of various possibilities for the choice 
of W® and their effect on the decay law. A further point that is worth 
discussing is the detailed decay properties in particular the deviations from 
the exponential law at small and large times as they follow from less idealised 
assumptions for H and W® than those we shall make here. Such finer de- 
scription will then also require to take the effect of the environment—as 
described by (1.5)—into account, which will lead back to an exponential 
decay low."* 

All these problems we shall not discuss here.'* In the following we shall 
consider only an approximate description, which will quickly lead to the 
desired results without regard for the fine points. For small width (Er/T > oo) 
this will provide a sufficiently accurate description. 


XX.2 Lifetime and Decay Rate 


In a decay experiment one has an ensemble of unstable physical systems R 
in a volume ¥, which is surrounded by counters that detect the decay 
products a’. The counting rate, i.e., the number of decays per second, N/T, 
is proportional to the number of unstable particles, Ng = pr¥ . The pro- 
portionality constant 


N/T 


A= Ne (2.1) 


is called the initial decay rate, or often just decay rate. 


12 For a review of these subjects the reader is referred to “ Decay Theory of Unstable Quantum 
Systems” by L. Fonda, G. C. Ghirardi and A. Rimini in Reports on Progress in Physics 41, 587 
(1978). Further methods not discussed in this review can be found in A. P. Grecos: “Solvable 
Models for Unstable States in Quantum Physics” Advances in Chemical Physics (1978) and 
A. George. F. Henin, F. Mayne, |. Prigogine: * Particles and Dissipations”, Hadron Journal, 
Vol. 1 (1978). 
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The number of decays per second, N/T, is equal to the rate of decrease 
of the number of unstable particles R, dN /dt. Therefore, by (2.1), 


dN 
- a (t) = ANg(t), (2.2) 


at least at t = 0. And if there is no further source or sink of unstable particles 
R in ¥, one is led from (2.2) to the exponential decay law 


N,(t) = Na(O)e~“, (2.3) 


where N,(0) is the number of unstable particles R at the time f= 0 when 
the decay process started. 

As N (0)}= —(Zo dN,(t), it follows that —dNp(t)/N,(0) is the proba- 
bility of a decay at time ¢ during the time interval dz. The decay rate, i.e., 
the probability per unit time for a decay at ¢, is therefore 


dP’ 1 dN,(t) 
= At) = — ———, 
dt FS x0) at G4) 
From (2.2) it then follows that 
Pt) | <9 = A, (25) 
which explains the name “initial decay rate” for A. 
The average lifetime of an unstable particle R, 
‘= i 1 PME) dt, (2.6) 
(0) 
is called the mean life, or just the lifetime. If (2.3) holds, then 
=e i a N,(O)e~ “At dt = ae (2.7) 
Jo Nx(0) * A Ft =0)' . 


In quantum physics the decaying particles are quantum physical systems 
and the observed quantities are probabilities. The observables measured on 
the decaying state W(t) may be the “property” of being in the original 
resonance state W®. The expectation value of W® in the state W(t) = 
eM WRellt represents the probability that R has not decayed. It is called 
the nondecay probability: 

P At) = Tr(W* W(t). (2.3a) 


It corresponds to the classical expression N p(t)/N p(0). 

Another observable that may be measured is the projection operator A 
on the space of physical states of the decay products (a’, T’), i.e., the property 
A. The expectation value of A in the state W(t) represents the probability 
for observing a’ and T’. This transition probability from the state W(t) 
to A, 


A(t) = <A), = Tr(AW(t)), (2.8) 
is called the decay probability of R into (a’, T’). 
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The quantum-mechanical decay rate is the transition rate from W(t) 

into A: 

dP . d 

<= A) =7 Me 
In (XIV.2.18) we have already calculated an expression for it, which we 
shall use below. 

The experimentally measured quantity is the lifetime t or the decay rate 
4. For large lifetimes (from | sec to 1 year), one measures N a(t) as a function 
of t and obtains J from (2.3). For shorter lifetimes down to 10°? sec one 
measures the lifetime by electronic methods. The decay to be studied must 
be preceded by another event, e.g., the formation of the unstable system, 
which is used to establish the origin of time. A counter is activated by this 
event and remains activated for a time t. The probability for the decay during 
this time is, according to (2.4) and (2.3), 


Number of times the activated 
counter detects a decay 


k } dt' dAe~** = kK — Dy 
ty) Number of times the counter 
is activated (2.9) 


One measures the ratio on the right-hand side as a function of t. As k is a 
constant independent of t [connected with N,(0) and the detector efficiency ], 
one can calculate J if one knows (2.9) for several values of f. 

For lifetimes shorter than 10°° sec one cannot directly measure t, but 
has to resort to indirect methods. As we shall see below, the decay rate is 
equal to the width. One then obtains the lifetime from the measurement of 
the width in resonance scattering. 


XXI.3 The Description of a Decaying State and the Exponential 
Decay Law 


We turn now to the quantum-mechanical calculation of the decay probability 
and decay rate. The first problem we have to face is the appropriate choice of 
W(0) = W®. 

Unstable systems are prepared by a scattering experiment in which the 
delay time is large. The cross section can then be fitted by a scattering 
amplitude consisting of a Breit-Wigner resonance term plus background. 
Intermediate physical states— which, according to Section II.10, are described 
by elements of ®—will represent such superpositions. It is often useful and 
sufficiently accurate to employ a description in which one isolates the inter- 
mediate quasistationary system, ignores its mode of formation, and represents 
it by an idealized decaying state. Such an idealized W(0) = W® —which need 
not be an operator in ®—must be chosen as if it was prepared before the 
decay interaction as an in-state W'"(t) which, at t = 0, under the influence of 
the interaction, has aquired a Breit~Wigner energy distribution. 
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Thus the density matrix? 
(E,4* |W(0)|E,d’*> = ¢E,4|Wi(0)|E,d’> 
= F(E, — Ep)p(E)'<a\|W* ||’) (3.1) 


must have a Lorentzian energy distribution function F(E, — Ep) with the 
resonance width as the width of the energy distribution: 


FE B= a OF 2 
Here we have normalized F(E — Ex) in such a way that? 
F(E — Er) > O(E — Ep) forT/2->0 (3.3) 
We shall also normalize the reduced matrix element to one: 
X <a||W®||a> = 1 (3.4) 


and discuss the normalization of W(0) = W®. We calculate 


Tr W(O) = ¥ <a" |WO)Ja*y =, | p(E) dE<EA* | W(O)|Ed*) 


=, [de FE ~ E,Xa|W"\a) 


IC pee 1 
on j aE CE Ey? + CD? 


il fp 1 
--{ dx —— 
(Se erate ey aie. al i 


ee arcian| Ee 3.5 
= \5 arctan RT} | | (3.5) 


R iP - ( pi 


For the sake of definiteness we have assumed that the spectrum of H and K 
is continuous and goes from 0 to «, excluding possible bound states. For 
physical reason the spectrum of H must be bounded from below, and we 
have chosen E,,;,, = 0. From (3.5) we see that for an infinitely narrow width, 
W(0) is normalized: 


dew * = Tr WO)e=1, (3.5a) 


2 The first equality of (3.1) has been proven in Appendix XV.A. 
3 See, e.g., Gelfand and Shilov (1964, Vol. 1, Chapter I, Section 2.1). 
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because arctan(Ep 2/T) > 2/2 for Eg 2/T > &. As Ep/T is large but finite, 
W(0) = W® is not quite normalized, but the error one makes is small. From 
now on we shall always work in this approximation, taking 


Ep/T > © for Er/T > 1 Gz 


and ignore the small error we make.* In this approximation the integral 
over the energy E is taken from —0o to + 0. 

The expectation value of the energy operator in an energy eigenstate 
W =|E )(Eo| is Tr(H|Eo)(Eol) = Eo. We shall now calculate the ex- 
pectation value of H in the state W(0) = W® given by (3.1) which is to 
describe a resonance with energy Ep and width T’: 


E 
24 ([/2)? 


ie 
Tr(HW®) = ¥ <al| WS |a> [a 2x (E — Ep) 


Er (” 1 1B pe De 
wee sae pooaee 
1 x= —EpR 2/T x ar 1 Z x= —Ep2/T Xa 1 


(3.8) 


In the limit (3.7) the first integral, according to (3.5), goes to x, and the 
second integral goes to zero, so that 


Tr(HW®) = Eg _ in the limit (3.7). (3.9) 


This is the result we desire from comparison with the stationary energy 
eigenstate. 

Equation (3.1) gives only the diagonal matrix elements of the statistical 
operator W® for a decaying system. To fix W* fully we also need to specify 
its matrix elements between generalized energy eigenvectors of different 
energy. These are obtained from (3.1), (3.2) and the requirement that W(t) 
describe a decaying state for t > 0 rather than a capture state (a state of 
exponential growth Corresponding to a pole in the second energy sheet 
immediately above the real positive axis). 

According to our discussion in Section II.11—in particular, according to 
(11.11.1)—W(0) should in general be given by 


Ww(0) = W® 
= 3 p(E,) dE, p(E,) dE, f(E, — Er)|E,d@’* > 
x <a |W < End ly “Geen! (3.10) 
Hence 
(Ed'* |W* | Ed*> = |f(E — Ex)|?<a@'|W* lad, (3.11) 


4 We should remind the reader that every theoretical description is an approximate de- 
scription and that the Breit- Wigner amplitude for resonance scattering may already be a crude 
approximation, as remarked in Section XVIII.6. Therefore the choice (3.1), (3.2) may result in 
greater inaccuracy than the approximation (3.7). 
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so that by comparison with (3.1) we must have 
Is 1 
2np(E) (E — Ex)’ + (1/2)? 


Except for an arbitrary factor of modulus 1, there are only two possibilities 


If(E — Ep)|’ = p(E)*F(E — Eg) = 


(3.12) 


from (3.12): 
ip 1 
es) \/2mp Ex +i 2—E Ge? 
and 
E — Ex) = : 3.14 
It ee Ei eE 


For a decaying state one has to choose (3.14). The property of a state 
(3.10) with (3.13) is discussed in Problem 1.** 

The integral in (3.10) should be extended over the spectrum of H; how- 
ever, in the approximation (3.7) we can take the integral from — 0 to oo. 
This will include generalized energy eigenvectors that do not belong to the 
spectrum of H; a mathematical remark concerning this point is given below. 

To simplify the notation we shall now specify the additional quantum 
numbers a and then ignore them. As discussed in the preceding chapters, 
the resonance R will have a definite value of angular momentum /| = /, and 
internal quantum numbers xk = kr. Therefore a choice of angular-mo- 
mentum eigenvectors 


a Welk > G5) 
is convenient. The reduced matrix element of W® in this basis is 
1 
<kll;| Wk’ I'T- 3) =a Oa Oup Ox: KR oy ‘Ir O11 aT (3.16) 


for a completely unpolarized state. W* of (3.10) is then 


p(E) dE I AE) dE’ f(E' — Ey) |Elg!,xt) 
x CBlalox fe = Epa G17) 


Varst: 


which we will write with (3.14) as 


Tr 1 
Rees 1/2 1/2 o 
Ww -x| (B)dE |p (E) dE rp TE 
I 
xe: mG: 
“IE XE _ pice @18) 


ignoring the additional quantum numbers. 


42 It describes a “‘capture state” and is associated with a pole in the third quadrant of the 
complex momentum-plane (Figure XVIII.5.3). Whereas the possible values of Fy + i 2 
(position of the poles) depend upon the particular properties (Hamiltonian) of the physical 
system under consideration, we know already from very general arguments (Section X VIII.5) 
that with every decaying state (Ep, I) the physical system must also have a capture state 
(Ez, —T). 
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A proper, stationary energy eigenstate of the observable H with eigen- 
value Ep is written as 
Ap = |Er)(Er|, (3.19) 


where 
H|Eg) = Ep|Ep) (3.20) 


and the |Eg) are proper eigenvectors and elements of the space of physical 


States: . 
CRE) — 1 (JEQEDC Hc O@"*). (3.21) 


As it is important to distinguish between the proper and generalized eigen- 
vectors, we shall here use a separate notation |E) for a proper eigenvector 
and |E)> for a generalized eigenvector (cf. Section II.8). We shall now show 
that W® of (3.18) can be written in a form analogous to (3.19). 


In order to do this we use a well-known mathematical theorem 
(the Titchmarsh theorem)° 
Let G(u) be the limit ofa function G(@), w = u + iv, that is analytic 
in the upper half plane and square-integrable over any line parallel 
to the real axis: 


+0 
sup | |G(u + iv)|? du < oo 


vo>0O 00 
Then 
1 + 00 G , 
G(w) = 7] AO opuneeO, (Cae 
~ OT Jee) =O) 
and 
Lf Glas 
G(w) = — wey dw’ for w real. (3.23) 
: Tl (ay = (60, 


Let us choose a “well-behaved” vector 6¢ ® <c # <®° with the 
property that <*E|@) is the limit of a function ¢*w|¢> = G(w) 
analytic in the upper half w-plane that fulfills the condition of the 
above theorem.** Then, writing w@ = Rp + iT /2, we obtain from 
(3.23) the connection 


o( Fs +15] (E, a) 


1 ae (a 1 
a = IE Wie E te 
i), EP (OCP |® aaa 


° See, e.g., Nussenzveig (1972, Chapter 1.6). 


2 This set of well behaved vectors is not dense in ® < .¥. In order to obtain all of ®, one has 
also to consider those @ € ® for which <w|@) is analytic in the lower half w-plane. # is then 
given by # = #,@® #-. where the realization of #, (#_) is the space of Hardy-class 
functions with respect to the upper (lower) half plane. 
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and the complex conjugate 


((Jelea)) (rea 56) 


=-5 | 7 dB pMUENGIE*) = — gp 629) 
For the latter we use the notation 
( o( Ex i is)) (6 De it) 
= (Vole a) 6) 
= 5 [ae p7EXIE*> Boao 6S 


If we omit the arbitrary “well-behaved” vector ¢, we can define 
the generalized vectors 


Ex 2 -) — Gal [ae p@1E) pe 
2 2ni \./p Ex, —il/2—E 
(3.27) 
and 
(Ep i it 7 - agi op JAE OE +12 —E 
(3.28) 


Applying the above theorem to <*E|H|@> (assuming that 
<*E|H|@)> also fulfills the conditions of the above theorem), we 
obtain 


(“Eq +i5| #16) 


a 1/2 ee : 
= aye | aE ECW rape 


ys ale 
Ex tis Exp +is 


and in the same way 


ie ale 
Ex—iz )= (ee -15)<e 


6) (3.29) 


(o|H Ep =i, ) (3.30) 
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Thus the |Eg — il’ /2*> are generalized eigenvectors of H with the 
eigenvalues Ep — il /2: 


a aL 
B- 15 )=(Ex- 15] 


DE ok a 
(E, = l 5 # = (k: + iS) En oP 5}: (3.32) 


We calculate the norm of this generalized eigenvector | Eg — i>: 


+ 
H aes > (3.31) 


2 


A 
Gag ene ) 
1 iv Sl + © 
= Bt p(Ex + is) iz dE’ p'/?(E’) dE p''7(E) 
1 
x CEE) eT 2-BE, +H 2—E 
Sal jess 1 
~ fag a(t / 5] i. (ae ee SPIE 


a 


7 i 
Ne 


5 


(333) 


On 


Though the generalized eigenvectors |E — iI/2*> are normaliz- 
able, the matrix element (Eg + iI /2|H"|Ep — iI */2> does not in 
general exist (for n even), as one can easily see by a calculation 
analogous to (3.33). 

Equation (3.33) tempts us to define the “normalized generalized 
eigenvectors with complex eigenvalue”: 


a . eee \* aie 
ER a is) = Qui)? ((» (Fs SF i5)) Er = ay (3.34) 
ir R Tr 1/2 Iee 


The generalized eigenvectors |Eg — if /2*)> do not, of course, 
appear in the spectral resolution 


(3.35) 


je [ome lE*><E)6> (ped) 


for the energy operators (cf. the mathematical note on the nuclear 
spectral theorem in Section II.8). They are elements of the space 
> which are not needed in the generalized basis vector expansion 
of elements ¢ of the space of physical states ®. (As we have seen 
above, their norm exists and they are even elements of #. How- 
ever, ||H"\Eg — iT /2)||? = (Ex + iT /2|H7"|Eg — if /2) does not 
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exist so that these generalized eigenvectors of H are not in the 
domain of H.° That these generalized eigenvectors are elements of 
H# is of no importance.) 

Usually when one constructs the rigged Hilbert space D c H < @* 
one tries to do this in such a way that these generalized eigenvectors 
do not appear in ®*; here we see that they may be useful for the 
description of decaying states.°* 


After these mathematical preparations we are ready to rewrite W® of (3.18). 
The integration in (3.18) extends over the semibounded spectrum of H. The 
integration in (3.27) and (3.28) extends from — oo to +00. Except for the 
small error of the approximation (3.7), W® is given by 


ai te T mad me 
ant. Jo(Ex -i5)o(Ex + #5] ao \ (Bett 


Er - i5) (« + iF (3.36) 
The approximate equality goes over into an exact equality in the limit 
Ep2/T — oo. Therefore, we define the ideal resonance state by the right- 
hand side of (3.36). Though this is not exactly a physically preparable state 
(as in all experimental situations, a resonance appears to be accompanied by 
a background), we shall from now on use this to describe a decaying state. 
For later references we now give the expression for W® with the additional 
quantum numbers 4 reinstated. Combining (3.10) and (3.36), one obtains 


wk 


Q 


(3.36a) 


nee me. 
Ez—i,.4 <a'||W*"\|a> Ex t+iz.d|. 


(2x0(ex-i")r)"") |e") 


OS ry ca = a a 
Le Jaro QE) -—paaE 62) 


es] 
b) 
>> 
a ae 
\| 


Ih 


© In the rigged Hilbert space ® c # < ®’ the Hermitian observable H is represented by 
the triplet of operators H < H = H+ c H” where H 1s essentially self-adjoint (and continuous 
with respect to the topology in ®), H is the closure of H and H * is the adjoint of Hin®*.A 
generalized eigenvector of H is the antilinear functional F,, = [> = |E — il’,2> such that 
F. (Hd) = <Hd|> = (o|H* |w> = w< |) for all @e®. This with the above ‘well- 
behaved”’ ¢ is the precise form of (3.31) (A. Bohm, The Rigged Hilbert Space and Quantum 
Mechanics [1978], Chapter IV, where the notation |@ » was used for the lw» here), and may 
clarify the ostensibly absurd statement that a generalized eigenvector of H is not in the domain 
of H. 

“In a different approach generalized eigenvectors with complex eigenvalue have already 
been used by H. Baumgartel in Resonances of Perturbed Self-adjoining Operators and their 
Eigenfunctionals, Math. Nachr. 75, 133 (1976). 
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For example, if the reduced matrix elements of WR are given by (3.16), 
corresponding to a resonance with angular momentum lp and internal 
quantum number Kp, then W* becomes 
1 r i 
R-y ———_ -—iz= ix Il 
WwW Lary ER I lalane) (E+ 15 R'3KrR 
Equation (3.36) is the form for the statistical operator of a decaying state 
with resonance parameters (Ep, I) that is analogous to the form (3.19) for a 
stationary state. W® = | Ey — iF/2)(Er — iI /2| is idempotent: 
Wwe = We, (3.37) 


This follows immediately from (3.33). 
From the form (3.36) of W*® the exponential decay law for the nondecay 
amplitude follows immediately. Using (1.4), (3.32), and (3.33), one obtains®? 


.  (3.36b) 


fort > 0: 
: ls j 
w(t) = eile Er ue 5) (E +4 io efit 
2 Z 
= e-HER-iT/2gHER+i0/2) | Ee — 5) (E as 5 (3.38) 


So in the approximation (3.7) 
WH=e Ww". (3.39) 


For the nondecay probability (2.3a) we then obtain, with (3.37) and 
(3.36), 


P(t) = Tr(WRW(t) = Tr(W®W)e™ =e "Tr We =e ™. (3.40) 


Equations (3.39) and (3.40) have been derived from (3.36), which entails the 
approximation (3.7).°° 

Comparing (3.40) with the classical decay law (2.3), we obtain for the 
initial decay rate and [from (2.7)] for the lifetime 


Vo 
t=5=5 (3.41) 


Thus the lifetime of a resonance with resonance parameters (Er, Tl) 
[fulfilling (3.7)], considered as an unstable physical system, is equal to the 
inverse width. 


6> Equation (3.38) is to be understood as <f|W(t)|d> = <e™'h|w><w* |e""b> for every 
ogeO # ,. Ina way analogous to (3.36) (3.38) one can define “capture states” of exponential 
growth using PED #H_. 


* As is well known to specialists, in the usual precise Hilbert space formulation one obtains 
deviations from the exponential decay law [L. Fonda et al. (1978)]. Deviations for large values of 
t follows from the condition that the spectrum of H be bounded from below which corresponds 
to a finite lower limit in the integrals (3.5), (3.8), (3.25), (3.26) and which is not the case if one uses 
the approximation (3.7). Deviations from the exponential law for small times ¢ follow from the 
condition that the energy in the decaying state be finite, which corresponds to the condition 
that the decaying state vector be in the domain of the Hilbert space operator H (or even in ®, 
the domain of H) which is not the case if one uses (3.27a). 
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XXI.4 Decay Rate 
A very quick but rather abstruse derivation’ of the initial decay rate A(0) 


is obtained if one inserts (3.36) into the expression (XIV.2.18), which we 
write, using (3.36a), 


At =0=-iDY fae p(E) dE’ p(E’)Xb| V|Ed* ) <E'd'* |V |b) 


S 1 - 1 Lae Jy 
a eos Reese. oma 


« (Es as iz. d|E'a’)<d'||WR\\d>. (4.1) 


As we have already taken the limit (3.7), we have to take « > I/2, so the 
present derivation is only true for narrow widths. 

One obtains immediately, using the completeness condition (XIV.2.11b) 
with (XTV.2.13), 


ic Ip 
At =0) = -—i1¥ V <bIVIER —- 1) (0 Bx nm | V |b) <a’|| Wa) 


b aa’ 


1 1 
- ~ al 4.2 
eae SEE a) 
From this we obtain with (XIV.5.20) 
Tr If 
P(e= 0) =2n> ) <bIV | Ep = i-, w) (a Ex, + i V|b> 


b da’ 2 


x O(Ep — E,)<a'||W* a). (4.3) 


The T-matrix (b|V|Ep — il /2,a) is a slowly varying function in the 
complex energy plane, so that for small values of I’/2 one can replace 
<b|V|Eg — i /2, a) by <b|V|Ep, a) and thus write for the initial decay 
rate 
At = 0) = 22) DY SE, — E)Xb| VIER, @)(Ex, a\V|b><a'||Wila>. (4.4) 


b aa’ 


If the summation extends over all possible decay products, then # is called 
the total decay rate. For the familiar case of a decaying state with angular 
momentum /, and internal quantum numbers kp [Equation (3.16)], this 
goes over into 


a> jee < Ey — Ep) Ol | Baa tae ts, Xa) l?- 
(4.5) 


as Lea 


7 A detailed derivation of the initial decay rate is suggested in Problem 2. See also A. Bohm, 
Rigged Hilbert Space and Decaying States. Univ. Texas preprint No. ORO-3992-353 (October) 
1978. 
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Equations (4.4) and (4.5) are formulas used for the calculation of decay 
rates in all branches of physics. Usually the interaction is weak, so that the 
Born approximation can be used for the transition matrix element 
(b|V|Eplglskp). This means |Egd) is replaced by the eigenvector of the 
free Hamiltonian 


oa (4.6) 


The |b» are already eigenvectors of K [cf. (XIV.2.14)]. So the program for 
the calculation of the initial decay rate is the following: One finds the eigen- 
states and generalized eigenvectors of the free-energy operator. These are 
the energy eigenvectors in the approximation in which the decaying system 
is considered stable. For instance, in the decay (1.3a), |Er, Ip, 13, Kp) will be 
the proper eigenvector of the energy operator of the atom when the interac- 
tion with the radiation is ignored and all energy levels are stable. |b> is the 
direct product 


|b> = ER, ie l;, Ke ) ® ly 


where the first factor is the ground-state energy eigenvector of the atom, 
and |y> is a basis vector of the one-photon space. Then one has to conjecture 
the interaction Hamiltonian V (or the transition operator T if the Born 
approximation is not satisfactory). For the case of the radiative decays of 
atoms this can be done to a certain extent by using the correspondence with 
a classical system; in other situations, e.g., for the decay of elementary 
particles, one has only very few general principles to limit the possibilities 
for this conjecture. After V and the solution of the free problem are known, 
one has to calculate the matrix elements of V between the free eigenvectors 
and insert it into (4.4) or (4.5). 

In this way the transition is considered as a transition from one “stable” 
state into another, caused by the interaction Hamiltonian V or by the 
transition operator TThe state of the decaying system Wis then 


AK = Y"|Epa)(GEgl, (4.7) 


which is orthogonal to the projection operator onto the space of decay 
products: 


OI, (4.8) 


AAR = 0. (4.9) 


Ais the projector on the space of physical states of all possible decay products. 
Let us then consider 


thea (4.10) 
and its expectation value in the state W(t), 
Tr(UIW(t)) = Tr(AW(t)) + Tr(A*® W(d), (4.11) 


which represents the probability of finding either the undecayed system or 
the decay products. At any time, Tr(/W(t)) = 1. Tr(AW(t)) is the transition 
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probability (2.8), and the nondecay probability (2.3a) is now Tr(A* W(t). 
Thus (4.11) says:” 


At) =1-P,(t)=1—e-™, (4.12) 


where we have used (3.40). The initial decay rate is then obtained from 
(4.12) as 


At =0)=T. (4.13) 


Thus the total decay rate of the decaying system is equal to the total 
width of the resonance. Therewith we have obtained the analogue to the 
classical relation (2.5), but we have also seen under what limitations upon 
the quantum-mechanical description we have obtained this classical re- 
lation. 


XXI.5 Partial Decay Rates 


Even more important in practical calculations than the total decay rate are 
the partial decay rates. In order to obtain a partial decay rate we have to 
specify the basis vectors |b) in (4.4) or (4.5). For the sake of definiteness we 
shall give our discussions first in terms of the angular-momentum basis. 
One can choose 


|b> = |E;, 6> = |E,, jism> (5.1) 
if the angular momentum commutes with the internal observables: 
[L;, 4°?] = 0. (5.1a) 


The internal observables 7°? will in general not agree with the internal ob- 
servables x°? of which the resonance is an eigenstate with eigenvalue Kp. 
We then write (4.5): 


r= ny fp Edn On Ey 


jaj ts 


ae rf I |<EjisnlErlglsip)|’. (5.2) 


This gives the rate for the decay of R into decay products with any of the 
internal quantum numbers y. If the detector detects only decay products 
with one particular value for 7, then one measures the quantity 


|< Ejisn| Vl Eglalskp)l’. 


G3) 


T(R — n) is called the partial decay rate for the decay of R into the state with 
internal quantum numbers n (decay channel n). Obviously one has 


Y T(R >) =P. (5.4) 


T(R -+ 9) = 2m | pX(E) dE HE ~ Fe) DY. 5 a 


72 Note that the first part of (4.12) and (4.11) has been derived from (4.10) and (4.9) which 
cannot be justified if one takes W® instead of A*. 
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Most often V is rotationally symmetric, so that only the term with j = I 
in (5.3) is different from zero. 

The weight function p"(E) is related to the normalization of the generalized 
eigenvectors | E,, j, j3>: 


CEjismlETisn'> = Sy (0(E)) HE — E')6j59j,3;- (5.5) 


Instead of the basis vector (5.1) one may use different basis vectors, e.g., 
the generalized momentum eigenvectors 


<py|p'’> = O(p — P)On 5 (5.6) 
if 
Gite Hee = 0. (5.6a) 
Then one has instead of (5.3) 


i 
I(R > n) = 20 {a o(E(p) — ye ara| K<pn|VEglelske)l°. (5.7) 


Instead of the momenta or angular momenta one can choose any other 
labels 


|E,, 5> = |E,, B, 2). (5.8) 
The only requirement is that 
[pPP, n°? ] = 0. 
For the basis vectors that one uses in the decay-rate formula one should 
always choose the most convenient ones, and which the most convenient 
ones are depends upon the particular problem.® Therefore we now discuss 


the general case. 
et 


be ne (5.9) 
be a complete system of commuting operators for the decaying system, 
where we have split a? = (x°", x°?) so that the decaying system has a definite 


value kK = Kp for the first set of operators and x°? have not been measured. 
The reduced matrix element is then 


1 
<a||W* a> = <xo\|W* || > = 0, Ong Oy ee (0,10) 
dim A(x) 
where #(K,p) is the eigenspace of x°? with eigenvalue Kp. 
Let 
Ke een Gp) 


be a complete system of commuting observables chosen so that the detector 
detects decay products with definite quantum number y and detects states 


* For instance, if it should. happen that (5.6a) is not fulfilled but that 
Pe 
commutes with 7°°, where M is the mass operator (which is a function of the internal quantum 
numbers), then one would choose the basis vectors |), 1>, where p; = p;/m(n). 
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with any values of f. Then one can ask for the partial decay rate of R into 
the decay channel y. Inserting (5.10) into (4.4) and using eigenvectors of 
(5.9) and (5.11), one obtains 


FS TR > 7); (5.12) 
where 


(Rn) = 2x | p%E) aE HCE — Eg) Y |CEnB|V Ege). (5.13) 
Bia 


Here 


= 1 
> ~ dim &(kp) Xd ot) 
means summing (or integrating) over all values of the labels for the basis 
vectors of the space of final states and averaging over all initial quantum 
numbers. One often states this as “summing over the initial states and averag- 
ing over the final states.” That is, however, a misleading statement, because 
the basis vectors | E, By > may have nothing to do with possible final states. 

The formula (5.13) for the decay rates has numerous applications in all 
branches of quantum physics. Whenever it makes sense to speak of a de- 
caying quantum-mechanical system, (5.13) with suitable chosen basis vectors 
can be applied. The interaction Hamiltonian or transition operator V 
depends, of course, upon the particular physical system that one considers 
and is determined by the algebra of observables. To find the right expression 
for V is one of the tasks of understanding the physical system. 


Problems 


1. Show that the state Wp given by (3.10) with (3.13) corresponds to a state of 
exponential growth (“capture” state). 
2. In this problem we suggest the calculations of the results in Section XXI.4 
without the use of (3.36) but in the approximation (3.7). 
(a) Define the quantity 
Fp (E, E') = YY pE\p(E)(Es)<E,b| V | Ea* > 


x (E'a'* |V\E,b><a||W*la’>(o(Er)) (P.0) 


Extend this function to complex values o,, w, w' of the energy and show that, 
as a consequence of the Hermiticity of V and time-reversal invariance, 


F (wo, 0') = Fa,(0, w’), (P.1a) 
F ,(w', 0) = F,,,(@, w’). (P.1b) 
Justify 
; : A mee 
Ey, bIVIEg + is, 4* ) = —( By, BIVIER — i554 (P.2) 


using the symmetry relation of the S-matrix, (eqn. XVIII.5.10). [(P.2) is needed 
for the calculations of the following problems]. 
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(b) Use (3.10) with (3.14) to show that the decay rate (XIV.2.18) can be written as 


At) = P,(t) + P,(t) (P.3) 
with 
7 F if a 
A(t) = =i ll dE, dE dE' #;,{E, E’) 
—iEt iE‘t 
ee ee ee ee 
E — E, + (1/2) E’ — Ep — i(T/2) E’ — E, — ie 
: r 
P(t) =i \ff dE, dE dE’ F,,(E, E’) 
TU 
—iEt iE't 
a ee denis IGE) 
E — Ey + i(T /2) E' — Ep — i(T /2) E — E, + ie 
and show 
P(t) = (P,(t))*. (P.6) 


(c) Use the approximation (3.7) and show, applying the theorem (3.23). that 
; ae Je 
P(t) = 4n*Te“"' 2 Re Fg, + erj2)-ie («, 7 Exgt+i 5): (E) 
(d) Show that from the condition A(t > 00) = 1 it follows that 


i ir 
Ane Re Mae eno (« Ss: i5) =, (P.8) 


and obtain (4.12) and (4.13). 


3. Assume that the detector has a very good energy resolution AE, < I. Then 
the decay rate per unit energy as a function of E, can be measured. 
(a) Using the same method as in Problem 2, show that this decay rate per unit 
energy is given by 
dP 


: ir 
os = =72 
gO =e 


F rilEp Lae 10, E, + iO). 
(b) Show that for the decaying angular-momentum state (3.16) the decay probability 
per unit energy is given by 
dA(t = 0) 1 
gue OC ae) ac KEyb|V|Eplnls kp)|?, 


where 
If 1 
2n (E; — Epy ae (T'/2)?" 


b(E,) is called the natural line width. 
Hint: Use (P.2) of problem 2. 


b(E,) = 


Epilogue 


The purpose of a physical theory is fulfilled if it provides a mathematical 
image of some domain of reality that allows us to relate experimental data 
and foresee new situations by making mathematical deductions. In this book 
we have restricted ourselves to this program and have never mentioned any 
of the further-reaching implications of quantum mechanics. Yet quantum 
mechanics has affected all scientific and even general human thinking. And 
when presented in its full generality, as done in this book, it effortlessly 
reveals the two facts whose lessons reach far beyond the boundaries of 
physics. 

Classical science is based on two assumptions: (1) the deterministic 
nature of predictions, and (2) the atomistic nature of understanding. Quan- 
tum mechanics teaches us to revise both. 

Classical theories are deterministic. The laws of classical physics are 
constructed in such a way that if the initial values of a system’s dynamical 
variables are given, then their precise values can be calculated for any later 
time. These laws were not simply derived solely from experience, and then 
accepted as the basis of scientific philosophy and general thinking. Rather, 
and perhaps to a greater degree, these laws have been extricated from nature 
because they were in accord with the prevailing philosophical idea that 
nothing can be without a cause. 

Probability statements in classical physics are always associated with 
insufficient knowledge, i.c., they are statements about the observer’s know- 
ledge and not about the physical system, which according to the principles 
of classical physics can be known to unlimited accuracy. In quantum theory 
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(as described in Section II.6 and elsewhere in the text) one can only say with 
what probability certain values can be expected, even if one knows the 
state as well as possible, i.e., even if the system is in a pure state. Thus in 
quantum theory statements are inherently probabilistic; the occurence of 
probability functions is not just a consequence of the observer’s insufficient 
knowledge, but a property attributed to the physical systems themselves. 
Quantum predictions of experimental results are statements of how a micro- 
physical process shows up in the macrophysical domain. These traces of 
microphysical processes in the macrophysical domain, the only source of 
human knowledge about such processes, do not obey deterministic laws. 
Earlier traces of a microphysical process do not determine later traces 
uniquely, but only probabilistically. Quantum theory teaches us that there 
are inherent limitations to human knowledge. 

The second point, that of the profoundly holistic nature (of the under- 
standing) of quantum physical systems, is not often emphasized, even though 
it is an obvious consequence of the quantum-mechanical description of 
physical systems. Although holism has already become rather widely ac- 
cepted in other disciplines (e.g., psychology), it has been resisted by the 
physicists, who seem to be influenced by the success of atomism in classical 
physics. The quantum physical system is a structured whole described by 
the mathematical structure of an algebra of operators. From the laws of 
the combination of quantum physical systems (Section III.5), it follows that 
there are observables—of the combination of the two subsystems (described 
by #, © Hz) that are incompatible with all observables of either subsystem 
(described by #, or #2). 

Thus, in quantum physics there exist holistic properties that cannot be 
obtained as combinations of the properties of the subsystems. In this sense 
the whole is not the sum of the parts. 

In the atomistic approach understanding comes from the reduction of 
the complex system to simpler subsystems by ever finer separations until 
one comes to the ultimate constituents. In quantum physics the presence of 
holistic properties prevents this reduction process, and the notion of ul- 
timate constituents loses its meaning. Atomism belongs to classical physics. 
A quantum physical system such as a molecule cannot be fully understood 
by dissecting it into nuclei and electrons, although, in the tradition of our 
scientific heritage, it is tempting to do this. What one arrives at in this way, 
however, is only the classical analogue of the quantum physical system, as 
in the Kepler system of proton and electron for the classical analogue of 
the hydrogen atom. An electron in an atom “is” something different from 
an electron in a linear accelerator, and the whole picture of the electron can 
only be displayed by giving its different aspects as they are mathematically 
described by the various basis systems in the space of physical states. 

The visual picture that one usually requires for the process of under- 
standing in is quantum physics not the geometrical picture of the object, 
but the picture of its image in the space of physical states. The reduction 
from the more complex to the simpler is performed not on the physical 
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object, leading to simpler consituent objects, but on the space of physical 
States, leading to the irreducible subspaces for ever simpler structures. At 
every stage of this reduction one still has a whole picture describing all 
aspects by the various basis systems of the subspaces, but a simpler one 
describing a narrower domain of physics. 

Dissecting a quantum physical system may destroy it. Therefore a quan- 
tum physical system (such as the CO molecule) cannot be understood only 
atomistically (as a di-atom) but is often more adequately understood 
holistically and functionally (as a vibrator rotator). 

Atomism has been a great achievement of the past, and all technology is 
based on it. But quantum theory has revealed its limitations and shown 
that even for simple systems a holistic method is needed also. 
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